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Transgenic expression constructs for vegetative plant tissue specific expression 
of nucleic acids 

FIELD OF THE INVENTION 

The invention relates to transgenic expression constructs and vectors comprising plant 
promoters with a non-seed tissue, preferably vegetative plant tissue specific expression 
profile, and the use of these transgenic expression constructs or vectors for the trans- 
genic expression of nucleic acid sequences in plants. The promoters of the invention 
demonstrate strong expression levels in most vegetative organs and tissues at different 
developmental stages (including but not limited to leafs, stem and roots), but low levels 
of expression in flowers (including the reproductive organs) and very low expression 
levels in seeds. The invention furthermore relates to transgenic plants and plant cells 
transformed with these expression constructs or vectors, to cultures, parts or propaga- 
tion material derived therefrom, and to the use of same for the preparation of food- 
stuffs, animal feeds, seed, pharmaceuticals or fine chemicals, to improve plant bio- 
mass, yield, or provide desirable phenotypes. Strong expression controlled by these 
promoters in young seedlings and cultured cells provide an appropriate tool to express 
selectable marker genes for plant transformation. 

BACKGROUND OF THE INVENTION 

The aim of plant biotechnology is the generation of plants with advantageous novel 
properties, such as pest and disease resistance, resistance to environmental stress 
(e.g., water-logging, drought, heat, cold, light-intensity, day-length, chemicals, efc), 
improved qualities (e.g., high yield of fruit, extended shelf-life, uniform fruit shape and 
color, higher sugar content, higher vitamins C and A content, lower acidity, etc.), or for 
the production of certain chemicals or pharmaceuticals (Dunwell 2000) . Furthermore 
resistance against, abiotic stress (drought, salt) and/or biotic stress (insects, fungal, 
nematode infections) can be increased. Crop yield enhancement and yield stability carl 
be achieved by developing genetically engineered plants with desired phenotypes (Alia 
1999; Sakamoto 1998). Appropriate promoters play an important role in regulating 
genes of interest to obtain the desired phenotypes. 

A basic prerequisite for the recombinant expression of specific genes in plants is the 
provision of plant-specific promoters. A variety of plant promoters are known. Known 
examples are constitutive promoters such as the nopaline synthase promoter from 
Agrobacterium, the promoter of the cauliflower mosaic virus (CaMV) 35S transcript 
(Odell 1985), the OCS (octopine synthase) promoter from Agrobacterium, /the ubiquitin 
promoter (Callis 1990), the promoters of the vacuolar ATPase subunits or the promoter 
of proline-rich protein from wheat (WO 91/13991). The disadvantage of these promo- 
tors is that they are constitutively active in virtually all of the plant's tissues. A targeted 
expression of genes in specific plant parts or at specific developmental stages is not 
possible with these promoters. 

Promoters with specificities for the anthers, ovaries, flowers, leaves, stems, roots and 
seeds have been described. The stringency of the specificity and the expression activ- 
ity of these promoters differ greatly. Promoters which must be mentioned are those 
which ensure a leaf-specific expression, such as the potato cytosolic FBPase promoter 
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(WO 97/05900), the Rubisco (ribulose-1,5-bisphosphate carboxylase) SSU (small sub- 
unit) promoter, or the potato ST-LSI promoter (Stockhaus 1 989) . 

Examples of other promoters are promoters with specificity for tubers, storage roots or 
roots, such as, for example, the class I patatin promoter (B33), the potato cathepsin D 
inhibitor promoter, the starch synthase (GBSS1) promoter or the sporamin promoter, 
fruit-specific promoters such as, for example, the tomato fruit-specific promoter (EP- 
A1 409 625), fruit-maturation-specific promoters such as, for example, the tomato fruit- 
maturation specific promoter (WO 94/21794), flower-specific promoters such as, for 
example, the phytoene synthase promoter (WO 92/16635) or the promoter of the P1-rr 
gene (WO 98/22593). 

A promoter which is regulated in a development-dependent fashion is described (Baer- 
son 1993). 

Promoters are described with tissue specificity for the mesophyll and the palisade cells 
in leaves (Broglie 1984), the dividing shoot and the root meristem (Atanassova 1992), 
pollen (Guerrero 1990), seed endosperm (Stalberg 1993). root epidermis (Suzuki 
1993), and for the root meristem, root vascular tissue and root knots (Bogusz 1990). 

Other known promoters are those which govern expression in seeds and plant em- 
bryos. Examples of seed-specific promoters are the phaseolin promoter (US 5,504,200, 
Bustos 1989), the promoter of 2S albumin gene (Joseffson 1987), the legumin' pro- 
moter (Shirsat 1989, the USP (unknown seed protein) promoter (Baumlein 1991), the 
promoter of the napin gene (Stalberg 1996), the promoter of the sucrose binding pro- 
tein (WO 00/26388) or the LeB4 promoter (Baumlein 1991). These promoters govern a 
seed-specific expression of storage proteins. 

Described is the promoter of the salt-inducible MsPRP2 gene from alfalfa (Bastola 
1998; WO 99/53016). This promoter is described to be highly root specific. 

Seeds are the most relevant agronomical product which is heavily used for feed and 
food purposes. However, expression of transgenes in seeds is in most cases neither 
necessary nor beneficial. For example, traits like herbicide resistance, resistance 
against insects, fungi, or nematode, cold or drought resistance do not need to be ex- 
pressed in seeds, since expression is only required in roots or green tissues. Expres- 
sion in seeds can have one or more of the following disadvantageous: 

1. Unnecessary expression of traits in seeds may lead to lower germination rates or at 
least unnecessary consumption of transcription / translation capacity resulting in 
yield loss or negatively affecting composition of the seed. 

2. Unnecessary expression of traits in seeds may raise higher hurdles in de-regulation 
proceedings (since a more substantial amount of the transgenic product is com- 
prised in the feed or food materials). 

3. Unnecessary expression of traits in seeds may negatively affect consumer accep- 
tance. 

Flowers comprise the plants reproductive organs (carpels and stamens). Expression in 
these tissues is for some traits also regarded as disadvantageous. For example, ex- 
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pression of the Bt protein (conferring resistance against corn root borer and other plant 
parasites) under a strong constitutive promoter resulted in expression in pollen and 
was discussed to have a toxic effect on beneficial pollen transferring insects like the 
monarch butterflies. 

It is, however, an unsolved demand in the plant biotech field to establish reliable ex- 
pression systems which express traits only in the vegetative plant tissues but not (or 
much less) in seeds and flowers (or their reproductive organs). As described above, 
there are numerous tissue specific promoters known in the art. However, in cases they 
have no or a low seed and/or flower expression capacity, they are highly specific for 
other tissues (like e.g., leaves or roots), but do not allow for a broad expression profile 
in all vegetative plant tissues. 

It is therefore an objective of the present invention, to provide promoter sequences 
which demonstrate a constitutive expression activity in all (or substantially all) non-seed 
tissues, preferably vegetative plant tissues and/or organs, but have only a low (pref- 
erably none) expression activity in seeds and preferably also in flowers. 

This objective is achieved by the promoter sequences provided within this invention. 

A first subject matter of the invention therefore relates to a transgenic expression con- 
structs for predominant expression of a nucleic acid sequence of interest in substan- 
tially all vegetative plant tissues comprising a promoter sequence selected from the 
group consisting of 

a) the promoter of the Pisum sativum ptxA gene, functional equivalent fragments and 
functional equivalent homologs thereof, or their complements, having essentially the 
same promoter activity as the promoter of the Pisum sativum ptxA gene, and 

b) the promoter of the Gfycine max extensin (SbHRGP3) gene, functional equivalent 
fragments and functional equivalent homologs thereof, or their complements, having 
essentially the same promoter activity as the promoter of the Gfycine max extensin 
(SbHRGP3) gene, 

wherein said promoter sequence is operably linked to a nucleic acid sequence of inter- 
est to be transgenically expressed, and wherein said promoter sequence is heterolo- 
gous with respect to said nucleic acid sequence of interest. 

The promoter sequences of the ptxA or SbHRGP3 gene demonstrate highly uniform, 
homogenous expression activity in virtually all vegetative organs and/or tissues of vari- 
ous species including dicotyledonous and monocotyledonous plants. In seeds, there is 
no expression activity detectable by GUS staining (see Example 7 and Fig. 3, 4 and 5) 
and low expression activity detectable by the more sensitive method of RT-PCR (-see 
Example 16 and Table 2). This is an advantage since very little, if any transgenic pro- 
tein will be expressed in the seed (which is used for food and feed purpose). For nu- 
merous agronomically valuable traits (e.g., stress resistance, improved water use, re- 
sistance against fungi or insects, etc.) no or low expression in seeds is required. There- 
fore, avoidance of this unnecessary expression may facilitate regulatory approval 
and/or consumer acceptance. 



BASF Plant Science GmbH 



20040055 
4 



PF 55368-2 US 



Furthermore, the promoter activity in the vegetative plant tissues and organs at the 
vegetative stages is relatively stronger than at the reproductive stages. In consequence 
the promoter activity is most active in the young vulnerable plantlet, but becomes lower 
in the mature plant This is of an additional advantage, especially for genes which con- 
fer resistance against biotic or abiotic stress factors (e.g., cold, drought, insect damage, 
etc.) since young, developing plants are considered much more vulnerable against said 
stress factors than mature plants. The promoter activity of the promoters of the inven- 
tion is especially high in non-differentiated or de-differentiated tissues or cells like, e.g., 
callus culture. This is very useful for utilizing the promoter in combination with selection 
marker in transformation protocols. 

The invention furthermore relates to a method for transgenic predominant expression 
of a nucleic acid sequence of interest in substantially all vegetative plant tissues com- 
prising: 

i. introduction of a transgenic expression construct into a plant cell or a plant, said 
transgenic expression construct comprising a promoter sequence selected from the 
group consisting of 

a) the promoter of the Pisum sativum ptxA gene, functional equivalent fragments 
and functional equivalent homologs thereof, or their complements, having essen- 
tially the same promoter activity as the promoter of the Pisum sativum ptxA gene 
and 

b) the promoter of the Glycine max extensin (SbHRGP3) gene, functional equivalent 
fragments and functional equivalent homologs thereof, or their complements, 
having essentially the same promoter activity as the promoter of the Glycine max 
extensin (SbHRGPS) gene, 

wherein said promoter sequence is operably linked to a nucleic acid sequence of in- 
terest to be transgenically expressed, and wherein said promoter sequence is het- 
erologous with respect to said nucleic acid sequence of interest, 
under conditions such that said nucleic acid sequence of interest is expressed in 
said plant cell and/or predominantly expressed in the vegetative plant tissue and/or 
organs of said transgenic plant. 

In a preferred embodiment, the method further comprises ii) identifying or selecting the 
transgenic plant cell comprising said transgenic expression construct. In another pre- 
ferred embodiment, the method further comprises iii) regenerating transgenic plant 
tissue from the transgenic plant cell. In an alternative preferred embodiment, the meth- 
ods further comprises iv) regenerating a transgenic plant from the transgenic plant cell. 

Preferably, the promoter sequence utilized in the inventive transgenic expression con- 
structs or methods of the invention is selected from the group of sequences consisting 

a) the promoter of the Pisum sativum ptxA gene as described by SEQ ID NO* 1 or its 
complement, 

b) a functional equivalent fragment of at least 50 consecutive base pairs of the pro- 
moter sequence described by SEQ ID NO: 1, or its complement, having essentially 
the same promoter activity as the promoter sequence described by SEQ ID NO: 1, 
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c) a functional equivalent homolog of the promoter sequence described by SEQ ID 
NO: 1 which has essentially the same promoter activity as the promoter sequence 
described by SEQ ID NO: 1 , and has 

i) a homology of at least 95% over a sequence of at least 100 consecutive base 
pairs to the sequence as described by SEQ ID NO: 1 and/or 

ii) hybridizes under high stringency conditions with a fragment of at least 50 con- 
secutive base pairs of the a nucleic acid molecule described by SEQ ID NO: 1. 

A preferred functional equivalent fragment of the ptxA promoter comprises a sequence 
from about base pair 300 to about base pair 583 of the sequence described by SEQ ID 
NO: 1. Another preferred functional equivalent homolog of the ptxA promoter com- 
prises a sequence from about base pair 300 to about base pair 828 of the sequence 
described by SEQ ID NO: 1. 

In another preferred embodiment, the promoter sequence utilized in the inventive 
transgenic expression constructs or methods of the invention is selected from the 
group of sequences consisting of: 

a) the promoter of the Glycine max extensin (SbHRGP3) gene as described by SEQ ID 
NO: 2, or its complement, 

b) a functional equivalent fragment of at least 50 consecutive base pairs of the pro- 
moter sequence described by SEQ ID NO: 2, or its complement, having essentially 
the same promoter activity as the promoter sequence described by SEQ ID NO: 2, 

c) a functional equivalent homolog of the promoter sequence described by SEQ ID 
NO: 2 which has essentially the same promoter activity as the promoter sequence 
described by SEQ ID NO: 2, and has 

i) a homology of at least 60% over a sequence of at least 100 consecutive base 
pairs to the sequence as described by SEQ ID NO: 2 and/or 

ii) hybridizes under high stringency conditions with a fragment of at least 50 con- 
secutive base pairs of the a nucleic acid molecule described by SEQ ID NO: 2. 

A preferred functional equivalent fragment of the SbHRGP3 promoter comprises a 
sequence from about base pair 800 to about base pair 1 179 of the sequence described 
by SEQ ID NO: 2. 

Other preferred functional equivalent homologs of the SbHRGP3 promoter comprise a 
sequence selected from the group described by SEQ ID NO: 7, 8 and 9. 

The transgenic expression construct of the invention may comprise further genetic con- 
trol sequences linked operably to the nucleic acid sequence of interest to be expressed 
is to, and/or additional functional elements. 

The nucleic acid sequence of interest transgenically expressed from the transgenic 
expression construct of the invention may results in expression of a protein encoded by 
said nucleic acid sequence (by transcription and subsequent translation), and/or 
expression of sense, antisense or double-stranded RNA encoded by said nucleic acid 
sequence of interest. 
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In another embodiment, nucleotide sequence encoding the transgenic expression con- 
struct of the invention is double-stranded. In yet another embodiment, the nucleotide 
sequence encoding the transgenic expression construct of the invention is single- 
stranded. 

5 

In yet another alternative embodiment, the transgenic expression construct of the in- 
vention is contained in a vector or in a non-human-organism, preferably a plant cell or a 
plant. In a preferred embodiment, the plant cell is derived from a dicotyledonous or 
monocotyledonous plant In a yet more preferred embodiment, the monocotyledonous 
10 plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple, 
rice, barley, oat, wheat, rye, yam, onion, banana, coconut, date, and hop. In a yet more 
preferred embodiment, the dicotyledonous plant is selected from the group consisting 
of rapeseed, tobacco, tomato, tagetes (marigold), soybean, pea, common bean, and 
papaya. 

15 

Further embodiments of the invention relate to the use of a transgenic organism of the 
invention or of cell cultures, parts of transgenic propagation material derived therefrom 
for the production of foodstuffs, animal feeds, seed, pharmaceuticals or fine chemicals. 

20 Another embodiment of the invention related to a method for production of a foodstuff, 
animal feed, seed, pharmaceutical or fine chemical employing a transgenic organism of 
the invention or of cell cultures, parts of transgenic propagation material derived there- 
from. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 Map of ptxA::GUS chimeric construct. 

The plasmid comprises an expression construct containing a ptxA promoter 
(ptxA) operably linked to a p-glucuronidase gene (gusiNT), and 3* untranslated 
30 region and termination derived from the nopaline synthase gene (Nos). 

Fig. 2 Map of SbHRGP3::GUS chimeric construct. 

The plasmid comprises an expression construct containing SbHRGP3 promoter 
Cp(gm)SbHRGP3) operably linked to a p-glucuronidase gene (GUS), and 3' un- 
35 translated region and termination derived from the nopaline synthase gene 

(Nos). 

Fig. 3 GUS expression controlled by pea ptxA promoter in Arabidopsis. The upper 
panel (I) represents the original photos with the GUS staining, while the lower 
40 panel (II) indicates areas distinctly stained blue by overlaid shaded areas. 

(A) seedlings (14 Days After Germination), 

(B) rosette leaf (25 DAG), 

(C) leaf from mature plants (35 DAG), 

(D) leaf from old plants (>40 DAG), 

45 (E) flowers and siliques from high expression line, 

(F) flowers and siliques from low expression line, and 

(G) crushed dried seeds in X-Gluc solution. 

Pictures represent reproducible expression patterns from 30 Ti lines and 10 T 2 
lines with low copy. 
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Fig. 4 GUS expression controlled by pea ptxA promoter in Canola. The upper panel (I) 
represents the original photos with the GUS staining, while the lower panel (II) 
indicates areas distinctly stained blue by overlaid shaded areas. 
(A) seedlings (3-4 Days After Germination), 
5 (B) shoot with leaves from young plants (first 2-3 true leaves) 

(C) leaf from mature plants (4-6 weeks after germination), 

(D) leaf from old plants at late flowering stage, 

(E) flower, 

(F) style after pollination, 
1 0 (G) mature seeds 



Fig. 5 GUS expression controlled by SbHRGP3 promoter in Arabidopsis. The upper 
pane! (I) represents the original photos with the GUS staining, while the lower 
panel (II) indicates areas distinctly stained blue by overlaid shaded areas. 
15 (A) seedlings (14 Days After Germination), 

(B) rosette leaf (25 DAG), 

(C) leaf from old plants (>40 DAG), 

(D) flowers and siliques. Only very slight expression could be detected in re- 
productive organs. 

20 Pictures represent reproducible expression patterns from 30 Ti lines and 10 T 2 

lines with low copy. 



Fig. 6a+b: Protein alignment of the ptxA protein with the MSPRP2 protein from Medi- 
cago sativa and other similar proteins. 
25 A: ptxA protein, GenBank Acc.-No.: X67427 

B: Medicago sativa proline-rich cell wall protein GenBank Acc.-No.: 
AF028841 

C: Lycopersicum escuientum proline rich protein GenBank Acc.-No.: 
X57076 

30 D: Vitis vinifera proline-rich protein 1 (PRP1) GenBank Acc.-No.: AY046416 

E: Arabidopsis thaliana protease inhibitor/seed storage/lipid transfer pro- 
tein (LTP) GenBank Acc.-No.: NM1 04929 



Fig. 7a+b: Alignment of the promoter regions of ptxA gene (A) and the MSPRP2 gene 
35 from Medicago sativa (B). 



Fig. 8a-c: Alignment of the SbHRGP3 promoter variations. 



Fig. 9 Map of ptxA promoter::ZmUbiquitin intron::GUS chimeric construct (PtxA- 
ZmUbi intron-GUS). The plasmid comprises an expression construct con- 
taining a ptxA promoter (ptxA) operably linked to maize Ubiquitin intron 
(ZmUbi intron), P-glucuronidase gene (gusINT), and 3' untranslated region 
and termination derived from the nopaline synthase gene (NOS). SM cas- 
sette stands for a selectable marker cassette. 
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GENERAL DEFINITIONS 

To facilitate understanding of the invention, a number of terms are defined below. 

It is to be understood that this invention is ftot limited to the particular methodology, 
protocols, cell lines, plant species or genera, constructs, and reagents described as 
such. It is also to be understood that the terminology used herein is for the purpose of 
describing particular embodiments only, and is not intended to limit the scope of the 
present invention which will be limited only by the appended claims. It must be noted 
that as used herein and in the appended claims, the singular forms "a," "and," and "the" 
include plural reference unless the context clearly dictates otherwise. Thus, for exam- 
ple, reference to "a vector" is a reference to one or more vectors and includes equiva- 
lents thereof known to those skilled in the art, and so forth. 

The term "about" is used herein to mean approximately, roughly, around, or in the re- 
gion of. When the term "about" is used in conjunction with a numerical range, it modi- 
fies that range by extending the boundaries above and below the numerical values set 
forth. In general, the term "about" is used herein to modify a numerical value above and 
below the stated value by a variance of 20 percent up or down (higher or lower). 

As used herein, the word "or" means any one member of a particular list and also in- 
cludes any combination of members of that list. 

The term "nucleic acid" refers to deoxyribonucleotides or ribonucleotides and polymers 
or hybrids thereof in either single-or double-stranded, sense or antisense form. 

Unless otherwise indicated, a particular nucleic acid sequence also implicitly encom- 
passes conservatively modified variants thereof (e.g., degenerate codon substitutions) 
and complementary sequences, as well as the sequence explicitly indicated. The term 
"nucleic acid" is used interchangeably herein with "gene", "cDNA, "mRNA", "oligonu- 
cleotide," and "polynucleotide". 

The phrase "nucleic acid sequence" as used herein refers to a consecutive list of ab- 
breviations, letters, characters or words, which represent nucleotides. In one embodi- 
ment, a nucleic acid can be a "probe" which is a relatively short nucleic acid, usually 
less than 100 nucleotides in length. Often a nucleic acid probe is from about 50 nucleo- 
tides in length to about 10 nucleotides in length. A "target region" of a nucleic acid is a 
portion of a nucleic acid that is identified to be of interest. A "coding region" of a nucleic 
acid is the portion of the nucleic acid which is transcribed and translated in a sequence- 
specific manner to produce into a particular polypeptide or protein when placed under 
the control of appropriate regulatory sequences. The coding region is said to encode 
such a polypeptide or protein. 

The term "antisense" is understood to mean a nucleic acid having a sequence com- 
plementary to a target sequence, for example a messenger RNA (mRNA) sequence 
the blocking of whose expression is sought to be initiated by hybridization with the tar- 
get sequence. 

The term "sense" is understood to mean a nucleic acid having a sequence which is 
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homologous or identical to a target sequence, for example a sequence which binds to a 
protein transcription factor and which is involved in the expression of a given gene. 
According to a preferred embodiment, the nucleic acid comprises a gene of interest 
and elements allowing the expression of the said gene of interest. 

5 

The term "gene" refers to a coding region operabiy joined to appropriate regulatory 
sequences capable of -regulating the expression of the polypeptide in some manner. A 
gene includes untranslated regulatory regions of DNA (e.g., promoters, enhancers, 
repressors, etc.) preceding (upstream) and following (downstream) the coding region 
10 (open reading frame, ORF) as well as, where applicable, intervening sequences (i.e., 
introns) between individual coding regions (/.e., exons). 

As used herein the term "coding region" when used in reference to a structural gene 
refers to the nucleotide sequences which encode the amino acids found in the nascent 

15 polypeptide as a result of translation of a mRNA molecule. The coding region is boun- 
ded, in eukaryotes, on the 5'~side by the nucleotide triplet "ATG" which encodes the 
initiator methionine and on the 3'-side by one of the three triplets which specify stop 
codons (i.e., TAA, TAG, TGA). In addition to containing introns, genomic forms of a 
gene may also include sequences located on both the 5'- and 3*-end of the sequences 

20 which are present on the RNA transcript. These sequences are referred to as "flanking" 
sequences or regions (these flanking sequences are located 5' or 3 1 to the non- 
translated sequences present on the mRNA transcript). The S'-flanking region may 
contain regulatory sequences such as promoters and enhancers which control or influ- 
ence the transcription of the gene. The 3-flanking region may contain sequences which 

25 direct the termination of transcription, posttranscriptional cleavage and polyadenylation. 

The terms "polypeptide", "peptide", "oligopeptide", "polypeptide", "gene product", "ex- 
pression product" and "protein" are used interchangeably herein to refer to a polymer 
or oligomer of consecutive amino acid residues. 

30 

Preferably, the term "isolated" when used in relation to a nucleic acid, as in "an isolated 
nucleic acid sequence" refers to a nucleic acid sequence that is identified and sepa- 
rated from at least one contaminant nucleic acid with which it is ordinarily associated in 
its natural source. Isolated nucleic acid is nucleic acid present in a form or setting that 

35 is different from that in which it is found in nature. In contrast, non-isolated nucleic ac- 
ids are nucleic acids such as DNA and RNA which are found in the state they exist in 
nature. For example, a given DNA sequence (e.g., a gene) is found on the host cell 
chromosome in proximity to neighboring genes; RNA sequences, such as a specific 
mRNA sequence encoding a specific protein, are found in the cell as a mixture with 

40 numerous other mRNAs which encode a multitude of proteins. However, an isolated 
nucleic acid sequence comprising SEQ ID NO:1 includes, by way of example, such 
nucleic acid sequences in ceils which ordinarily contain SEQ ID NO:1 where the nu- 
cleic acid sequence is in a chromosomal or extrachromosomal location different from 
that of natural ceils, or is otherwise flanked by a different nucleic acid sequence than 

45 that found in nature. The isolated nucleic acid sequence may be present in single- 
stranded or double-stranded form. When an isolated nucleic acid sequence is to be 
utilized to express a protein, the nucleic acid sequence will contain at a minimum at 
least a portion of the sense or coding strand (i.e., the nucleic acid sequence may be 
single-stranded). Alternatively, it may contain both the sense and anti-sense strands 
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(/.ft, the nucleic acid sequence may be double-stranded). 

As used herein, the term "purified" refers to molecules, either nucleic or amino acid 
sequences, that are removed from their natural environment, isolated or separated. An 
5 "isolated nucleic acid sequence" is therefore a purified nucleic acid sequence. "Sub- 
stantially purified" molecules are at least 60% free, preferably at least 75% free, and 
more preferably at least 90% free from other components with which they are naturally 
associated. 

10 As used herein, the terms "complementary" or "complementarity" are used in reference 
to nucleotide sequences related by the base-pairing rules. For example, the sequence 
ff-AGT-S' is complementary to the sequence 5'-ACT-3\ Complementarity can be "par- 
tial" or "total." "Partial" complementarity is where one or more nucleic acid bases is not 
matched according to the base pairing rules. "Total" or "complete" complementarity 

15 between nucleic acids is where each and every nucleic acid base is matched with an- 
other base under the base pairing rules. The degree of complementarity between nu- 
cleic acid strands has significant effects on the efficiency and strength of hybridization 
between nucleic acid strands. 

20 A "complement" of a nucleic acid sequence as used herein refers to a nucleotide se- 
quence whose nucleic acids show total complementarity to the nucleic acids of the nu- 
cleic acid sequence. 

The term "wild-type", "natural" or of "natural origin" means with respect to an organism, 
25 polypeptide, or nucleic acid sequence, that said organism is naturally occurring or avai- 
lable in at least one naturally occurring organism which is not changed, mutated, or 
otherwise manipulated by man. 

The term "transgenic" or "recombinant" when used in reference to a cell refers to a cell 
30 which contains a transgene, or whose genome has been altered by the introduction of 
a transgene. The term "transgenic" when used in reference to a tissue or to a plant 
refers to a tissue or plant, respectively, which comprises one or more cells that contain 
a transgene, or whose genome has been altered by the introduction of a transgene. 
Transgenic cells, tissues and plants may be produced by several methods including the 
35 introduction of a "transgene" comprising nucleic acid (usually DNA) into a target cell or 
integration of the transgene into a chromosome of a target cell by way of human inter- 
vention, such as by the methods described herein. 

The term "transgene" as used herein refers to any nucleic acid sequence which is in- 
40 troduced into the genome of a cell by experimental manipulations. A transgene may be 
an "endogenous DNA sequence," or a "heterologous DNA sequence" (i.e., "foreign 
DNA"). The term "endogenous DNA sequence" refers to a nucleotide sequence which 
is naturally found in the cell into which it is introduced so long as it does not contain 
some modification (e.g., a point mutation, the presence of a selectable marker gene, 
45 etc.) relative to the naturally-occurring sequence. The term "heterologous DNA se- 
quence" refers to a nucieotide sequence which is ligated to, or is manipulated to be- 
come ligated to, a nucleic acid sequence to which it is not ligated in nature, or to which 
it is ligated at a different location in nature. Heterologous DNA is not endogenous to the 
cell into which it is introduced, but has been obtained from another cell. Heterologous 
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DNA also includes an endogenous DNA sequence which contains some modification. 
Generally, although not necessarily, heterologous DNA encodes RNA and proteins that 
are not normally produced by the cell into which it is expressed. Examples of heterolo- 
gous DNA include reporter genes, transcriptional and translational regulatory se- 
5 quences, selectable marker proteins (e.g., proteins which confer drug resistance), etc. 
Preferably, the term "transgenic" or "recombinant" with respect to a regulatory se- 
quence (e.g., a promoter of the invention) means that said regulatory sequence is 
covalently joined and adjacent to a nucleic acid to which it is not adjacent in its natural 
environment. 

10 

The term "foreign gene" refers to any nucleic acid (e.g., gene sequence) which is intro- 
duced into the genome of a cell by experimental manipulations and may include gene 
sequences found in that cell so long as the introduced gene contains some modifica- 
tion (e.g., a point mutation, the presence of a selectable marker gene, etc.) relative to 
15 the naturally-occurring gene. 

Preferably, the term "transgene" or "transgenic" with respect to, for example, a nucleic 
acid sequence (or an organism, expression construct or vector comprising said nucleic 
acid sequence) refers to all those constructs originating by experimental manipulations 
20 in which either 

a) said nucleic acid sequence, or 

b) a genetic control sequence linked operably to said nucleic acid sequence (a), for 
example a promoter, or 

c) (a) and (b) 

25 

is not located in its natural genetic environment or has been modified by experimental 
manipulations, an example of a modification being a substitution, addition, deletion, 
inversion or insertion of one or more nucleotide residues. Natural genetic environment 
refers to the natural chromosomal locus in the organism of origin, or to the presence in 

30 a genomic library. In the case of a genomic library, the natural genetic environment of 
the nucleic acid sequence is preferably retained, at least in part. The environment 
flanks the nucleic acid sequence at least at one side and has a sequence of at least 
50 bp, preferably at least 500 bp, especially preferably at least 1,000 bp, very espe- 
cially preferably at least 5,000 bp, in length. A naturally occurring expression construct 

35 - for example the naturally occurring combination of a promoter with the corresponding 
gene - becomes a transgenic expression construct when it is modified by non-natural, 
synthetic "artificial" methods such as, for example, mutagenization. Such methods have 
been described (US 5,565,350; WO 00/15815). 

40 "Recombinant" polypeptides or proteins refer to polypeptides or proteins produced by 
recombinant DNA techniques, /.e., produced from cells transformed by an exogenous 
recombinant DNA construct encoding the desired polypeptide or protein. Recombinant 
nucleic acids and polypeptide may also comprise molecules which as such does not 
exist in nature but are modified, changed, mutated or otherwise manipulated by man. 

45 

The terms "heterologous nucleic acid sequence" or "heterologous DNA" are used inter- 
changeably to refer to a nucleotide sequence which is ligated to a nucleic acid se- 
quence to which it is not ligated in nature, or to which it is ligated at a different location 
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in nature. Heterologous DNA is not endogenous to the cell into which it is introduced, 
but has been obtained from another cell. Generally, although not necessarily, such 
heterologous DNA encodes RNA and proteins that are not normally produced by the 
cell into which it is expressed. 

The "efficiency of transformation" or "frequency of transformation" as used herein can 
be measured by the number of transformed cells (or transgenic organisms grown from 
individual transformed cells) that are recovered under standard experimental conditions 
(i.e. standardized or normalized with respect to amount of cells contacted with foreign 
DNA, amount of delivered DNA, type and conditions of DNA delivery, general culture 
conditions etc.) For example, when isolated zygotes are used as starting material for 
transformation, the frequency of transformation can be expressed as the number of 
transgenic plant lines obtained per 100 isolated zygotes transformed. 

The term "cell" refers to a single cell. The term "cells" refers to a population of cells. 
The population may be a pure population comprising one cell type. Likewise, the popu- 
lation may comprise more than one cell type. In the present invention, there is no limit 
on the number of cell types that a cell population may comprise. The cells may be syn- 
chronize or not synchronized, preferably the cells are synchronized. 

The term "plant" as used herein refers to a plurality of plant cells which are largely dif- 
ferentiated into a structure that is present at any stage of a plant's development. Such 
structures include one or more plant organs including, but are not limited to, fruit, shoot, 
stem, leaf, flower petal, etc. 

The term "organ" with respect to a plant (or "plant organ") means parts of a plant and 
may include (but shall not limited to) for example roots, fruits, shoots, stem, leaves, 
anthers, sepals, petals, pollen, seeds, etc. 

The term "tissue" with respect to a plant (or "plant tissue") means arrangement of mul- 
tiple plant cells including differentiated and undifferentiated tissues of plants. Plant tis- 
sues may constitute part of a plant organ (e.g., the epidermis of a plant'leaf) but may 
also constitute tumor tissues and various types of cells in culture (e.g., single cells, 
protoplasts, embryos, catli, protocorm-like bodies, etc.). Plant tissue may be in planta] 
in organ culture, tissue culture, or cell culture. 

The term "chromosomal DNA" or "chromosomal DNA-sequence" is to be understood 
as the genomic DNA of the cellular nucleus independent from the cell cycle status. 
Chromosomal DNA might therefore be organized in chromosomes or chromatids, they 
might be condensed or uncoiled. An insertion into the chromosomal DNA can be dem- 
onstrated and analyzed by various methods known in the art like e.g., polymerase 
chain reaction (PCR) analysis, Southern blot analysis, fluorescence in situ hybridization 
(FISH), and in situ PCR. 

The term "structural gene" as used herein is intended to mean a DNA sequence that is 
transcribed into mRNA which is then translated into a sequence of amino acids charac- 
teristic of a specific polypeptide. 



The term "nucleotide sequence of interest" refers to any nucleotide sequence, the 
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nipulation of which may be deemed desirable for any reason (e.g., confer improved 
qualities), by one of ordinary skiii in the art. Such nucleotide sequences include, but are 
not limited to, coding sequences of structural genes (e.g., reporter genes, selection 
marker genes, oncogenes, drug resistance genes, growth factors, etc.), and non- 
5 coding regulatory sequences which do not encode an mRNA or protein product, (e.g., 
promoter sequence, polyadenylation sequence, termination sequence, enhancer se- 
quence, etc.). 

The term "expression" refers to the biosynthesis of a gene product. For example, in the 
10 case of a structural gene, expression involves transcription of the structural gene into 
mRNA and - optionally - the subsequent translation of mRNA into one or more polypep- 
tides. 

The term "transformation" as used herein refers to the introduction of genetic material 

15 (e.g., a transgene) into a cell. Transformation of a cell may be stable or transient. The 
term "transient transformation" or "transiently transformed" refers to the introduction of 
one or more transgenes into a cell in the absence of integration of the transgene into 
the host celt's genome. Transient transformation may be detected by, for example, en- 
zyme-linked immunosorbent assay (ELISA) which detects the presence of a polypep- 

20 tide encoded by one or more of the transgenes. Alternatively, transient transformation 
may be detected by detecting the activity of the protein (e.g., p-glucuronidase) encoded 
by the transgene (e.g., the uidA gene) as demonstrated herein [e.g., histochemical as- 
say of GUS enzyme activity by staining with X-gluc which gives a blue precipitate in the 
presence of the GUS enzyme; and a chemiluminescent assay of GUS enzyme activity 

25 using the GUS-Light kit (Tropix)]. The term "transient transformant" refers to a cell 
which has transiently incorporated one or more transgenes. In contrast, the term "sta- 
ble transformation" or "stably transformed" refers to the introduction and integration of 
one or more transgenes into the genome of a cell, preferably resulting in chromosomal 
integration and stable heritability through meiosis. Stable transformation of a cell may 

30 be detected by Southern blot hybridization of genomic DNA of the cell with nucleic acid 
sequences which are capable of binding to one or more of the transgenes. Alterna- 
tively, stable transformation of a cell may also be detected by the polymerase chain 
reaction of genomic DNA of the cell to amplify transgene sequences. The term "stable 
transformant" refers to a cell which has stably integrated one or more transgenes into 

35 the genomic DNA. Thus, a stable transformant is distinguished from a transient trans- 
formant in that, whereas genomic DNA from the stable transformant contains one or 
more transgenes, genomic DNA from the transient transformant does not contain a 
transgene. Transformation also includes introduction of genetic material into plant cells 
in the form of plant viral vectors involving epichromosomal replication and gene ex- 

40 pression which may exhibit variable properties with respect to meiotic stability. 

The terms "infecting" and "infection" with a bacterium refer to co-incubation of a target 
biological sample, (e.g., cell, tissue, etc.) with the bacterium under conditions such that 
nucleic acid sequences contained within the bacterium are introduced into one or more 
45 cells of the target biological sample. 

The term "Agrobacterium" refers to a soil-borne, Gram-negative, rod-shaped phytopa- 
thogenic bacterium which causes crown gall. The term "Agrobacterium" includes, but is 
not limited to, the strains Agrobacterium tumefaciens, (which typically causes crown 
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gall in infected plants), and Agrobacterium rhizogenes (which causes hairy root disease 
in infected host plants). Infection of a plant cell with Agrobacterium generally results in 
the production of opines (e.g., nopaline, agropine, octopine etc.) by the infected cell. 
Thus, Agrobacterium strains which cause production of nopaline (e.g., strain LBA4301 , 
C58, A208) are referred to as "nopaline-type" Agrobacteria; Agrobacterium strains 
which cause production of octopine (e.g., strain LBA4404, Ach5, B6) are referred to as 
"octopine-type" Agrobacteria; and Agrobacterium strains which cause production of 
agropine (e.g., strain EHA105, EHA101, A281) are referred to as "agropine-type" 
Agrobacteria. 

The terms "bombarding, "bombardment," and "biofistic bombardment" refer to the 
process of accelerating particles towards a target biological sample (e.g., cell, tissue, 
etc.) to effect wounding of the cell membrane of a cell in the target biological sample 
and/or entry of the particles into the target biological sample. Methods for biolistic bom- 
bardment are known in the art (e.g., US 5,584,807, the contents of which are herein 
incorporated by reference), and are commercially available (e.g., the helium gas-driven 
microprojectile accelerator (PDS-1 000/He) (BioRad). 

The term "microwounding" when made in reference to plant tissue refers to the intro- 
duction of microscopic wounds in that tissue. Microwounding may be achieved by, for 
example, particle bombardment as described herein. 

The term "expression construct" or "expression construct" as used herein is intended to 
mean the combination of any nucleic acid sequence to be expressed in operable link- 
age with a promoter sequence and - optionally - additional elements (like e.g., termi- 
nator and/or polyadenylation sequences) which facilitate expression of said nucleic 
acid sequence. 

The term "promoter," "promoter element," or "promoter sequence" as used herein, re- 
fers to a DNA sequence which when ligated to a nucleotide sequence of interest is ca- 
pable of controlling the transcription of the nucleotide sequence of interest into mRNA. 
A promoter is typically, though not necessarily, located 5* (/.e., upstream) of a nucleo- 
tide sequence of interest (e.g., proximal to the transcriptional start site of a structural 
gene) whose transcription into mRNA it controls, and provides a site for specific binding 
by RNA polymerase and other transcription factors for initiation of transcription. 

Promoters may be tissue specific or cell specific. The term "tissue specific" as it applies 
to a promoter refers to a promoter that is capable of directing selective expression of a 
nucleotide sequence of interest to a specific type of tissue (e.g., petals) in the relative 
absence of expression of the same nucleotide sequence of interest in a different type 
of tissue (e.g., roots). Tissue specificity of a promoter may be evaluated by, for exam- 
ple, operably linking a reporter gene to the promoter sequence to generate a reporter 
construct, introducing the reporter construct into the genome of a plant such that the 
reporter construct is integrated into every tissue of the resulting transgenic plant, and 
detecting the expression of the reporter gene (e.g., detecting mRNA, protein, or the 
activity of a protein encoded by the reporter gene) in different tissues of the transgenic 
plant. The detection of a greater level of expression of the reporter gene in one or more 
tissues relative to the level of expression of the reporter gene in other tissues shows 
that the promoter is specific for the tissues in which greater levels of expression are 
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detected. The term "cell type specific" as applied to a promoter refers to a promoter 
which is capable of directing selective expression of a nucleotide sequence of interest 
in a specific type of cell in the relative absence of expression of the same nucleotide 
sequence of interest in a different type of cell within the same tissue. The term "cell 
5 type specific" when applied to a promoter also means a promoter capable of promoting 
selective expression of a nucleotide sequence of interest in a region within a single 
tissue. Cell type specificity of a promoter may be assessed using methods well known 
in the art, e.g., GUS activity staining (as described for example in Example 7) or immu- 
nohistochemical staining. Briefly, tissue sections are embedded in paraffin, and paraffin 
10 sections are reacted with a primary antibody which is specific for the polypeptide prod- 
uct encoded by the nucleotide sequence of interest whose expression is controlled by 
the promoter. A labeled (e.g., peroxidase conjugated) secondary antibody which is 
specific for the primary antibody is allowed to bind to the sectioned tissue and specific 
binding detected (e.g., with avidin/biotin) by microscopy. 

15 

Promoters may be constitutive or regulatable. The term "constitutive" when made in 
reference to a promoter means that the promoter is capable of directing transcription of 
an operably linked nucleic acid sequence in the absence of a stimulus (e.g., heat 
shock, chemicals, light, etc.). Typically, constitutive promoters are capable of directing 
20 expression of a transgene in substantially any ceil and any tissue. In contrast, a "regu- 
latable" promoter is one which is capable of directing a level of transcription of an op- 
erably linked nuclei acid sequence in the presence of a stimulus (e.g., heat shock, 
chemicals, light, etc.) which is different from the level of transcription of the operably 
linked nucleic acid sequence in the absence of the stimulus. 

25 

The term "operable linkage" or "operably linked" is to be understood as meaning, for 
example, the sequential arrangement of a regulatory element (e.g. a promoter) with a 
nucleic acid sequence to be expressed and, if appropriate, further regulatory elements 
(such as e.g., a terminator) in such a way that each of the regulatory elements can ful- 

30 fill its intended function to allow, modify, facilitate or otherwise influence expression of 
said nucleic acid sequence. The expression may result depending on the arrangement 
of the nucleic acid sequences in relation to sense or antisense RNA. To this end, direct 
linkage in the chemical sense is not necessarily required. Genetic control sequences 
such as, for example, enhancer sequences, can also exert their function on the target 

35 sequence from positions which are further away, or indeed from other DNA molecules. 
Preferred arrangements are those in which the nucleic acid sequence to be expressed 
recombinantly is positioned behind the sequence acting as promoter, so that the two 
sequences are linked covalently to each other. The distance between the promoter 
sequence and the nucleic acid sequence to be expressed recombinantly is preferably 

40 less than 200 base pairs, especially preferably less than 100 base pairs, very espe- 
cially preferably less than 50 base pairs. Operable linkage, and an expression con- 
struct, can be generated by means of customary recombination and cloning techniques 
as described (e.g., in Maniatis 1989; Silhavy 1984; Ausubel .1987; Gelvin 1990). How- 
ever, further sequences which, for example, act as a linker with specific cleavage sites 

45 for restriction enzymes, or as a signal peptide, may also be positioned between the two 
sequences. The insertion of sequences may also lead to the expression of fusion pro- 
teins. Preferably, the expression construct, consisting of a linkage of promoter and nu- 
cleic acid sequence to be expressed, can exist in a vector-integrated form and be in- 
serted into a plant genome, for example by transformation. 
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The terms "homology" or "identity" when used in relation to nucleic acids refers to a 
degree of complementarity. Homology or identity between two nucleic acids is under- 
stood as meaning the identity of the nucleic acid sequence over in each case the entire 
length of the sequence, which is calculated by comparison with the aid of the program 
5 algorithm GAP (Wisconsin Package Version 10.0, University of Wisconsin, Genetics 
Computer Group (GCG), Madison, USA) with the parameters- being set as follows: 

Gap Weight: 1 2 Length Weight: 4 

Average Match: 2,912 Average Mismatch:-2,003 

10 For example, a sequence with at least 95% homology (or identity) to the sequence 
SEQ ID NO. 1 at the nucleic acid level is understood as meaning the sequence which, 
upon comparison with the sequence SEQ ID NO. 1 by the above program algorithm 
with the above parameter set, has at least 95% homology. There may be partial ho- 
mology (i.e., partial identity of less then 100%) or complete homology {i.e., complete 

15 identity of 100%). 

Alternatively, a partially complementary sequence is understood to be one that at least 
partially inhibits a completely complementary sequence from hybridizing to a target 
nucleic acid and is referred to using the functional term "substantially homologous." 

20 The inhibition of hybridization of the completely complementary sequence to the target 
sequence may be examined using a hybridization assay (Southern or Northern bfot, 
solution hybridization and the like) under conditions of low stringency. A substantially 
homologous sequence or probe (i.e., an oligonucleotide which is capable of hybridizing 
to another oligonucleotide of interest) will compete for and inhibit the binding (/.e., the 

25 hybridization) of a completely homologous sequence to a target under conditions of low 
stringency. This is not to say that conditions of low stringency are such that non- 
specific binding is permitted; low stringency conditions require that the binding of two 
sequences to one another be a specific (i.e., selective) interaction. The absence of 
non-specific binding may be tested by the use of a second target which lacks even a 

30 partial degree of complementarity (e.g., less than about 30% identity); in the absence 
of non-specific binding the probe will not hybridize to the second non-complementary 
target. 

When used in reference to a double-stranded nucleic acid sequence such as a cDNA 
35 or genomic clone, the term "substantially homologous" refers to any probe which can 
hybridize to either or both strands of the double-stranded nucleic acid sequence under 
conditions of low stringency as described infra. 

When used in reference to a single-stranded nucleic acid sequence, the term "substan- 
40 tially homologous" refers to any probe which can hybridize to the single-stranded nu- 
cleic acid sequence under conditions of low stringency as described infra. 

The term "hybridization" as used herein includes "any process by which a strand of 
nucleic acid joins with a complementary strand through base pairing." (Coombs 1994). 
45 Hybridization and the strength of hybridization (i.e., the strength of the association be- 
tween the nucleic acids) is impacted by such factors as the degree of complementarity 
between the nucleic acids, stringency of the conditions involved, the Tm of the formed 
hybrid, and the G:C ratio within the nucleic acids. 
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As used herein, the term Tm" is used in reference to the "melting temperature." The 
melting temperature is the temperature at which a population of double-stranded nu- 
cleic acid molecules becomes half dissociated into single strands. The equation for 
calculating the Tm of nucleic acids is well known in the art. As indicated by standard 
5 references, a simple estimate of the Tm value may be calculated by the equation: 
Tm=81. 5+0.41 (% G+C), when a nucleic acid is in aqueous solution at 1 M NaCI [see 
e.g., Anderson and Young, Quantitative Filter Hybridization, in Nucleic Acid Hybridiza- 
tion (1985)]. Other references include more sophisticated computations which take 
structural as well as sequence characteristics into account for the calculation of Tm. 

10 

Low stringency conditions when used in reference to nucleic acid hybridization com- 
prise conditions equivalent to binding or hybridization at 68°C. in a solution consisting 
of 5x SSPE (43.8 g/L NaCI, 6.9 g/L NaH 2 P0 4 .H 2 0 and 1.85 g/L EDTA, pH adjusted to 
7.4 with NaOH), 1% SDS, 5x Denhardfs reagent [50x Denhardfs contains the following 

15 per 500 mL: 5 g Ficoll (Type 400, Pharmacia), 5 g BSA (Fraction V; Sigma)] and 100 
pg/mL denatured salmon sperm DNA followed by washing in a solution comprising 
0.2x SSPE, and 0.1% SDS at room temperature when a DNA probe of about 100 to 
about 1000 nucleotides in length is employed. High stringency conditions when used in 
reference to nucleic acid hybridization comprise conditions equivalent to binding or 

20 hybridization at 68° C. in a solution consisting of 5x SSPE, 1% SDS, 5x Denhardfs 
reagent and 100 \sg/mL denatured salmon sperm DNA followed by washing in a solu- 
tion comprising 0.1x SSPE, and 0.1% SDS at 68° C. when a probe of about 100 to 
about 1000 nucleotides in length is employed. 

25 The term "equivalent" when made in reference to a hybridization condition as it relates 
to a hybridization condition of interest means that the hybridization condition and the 
hybridization condition of interest result in hybridization of nucleic acid sequences 
which have the same range of percent (%) homology. For example, if a hybridization 
condition of interest results in hybridization of a first nucleic acid sequence with other 

30 nucleic acid sequences that have from 80% to 90% homology to the first nucleic acid 
sequence, then another hybridization condition is said to be equivalent to the hybridiza- 
tion condition of interest if this other hybridization condition also results in hybridization 
of the first nucleic acid sequence with the other nucleic acid sequences that have from 
80% to 90% homology to the first nucleic acid sequence. 

35 

When used in reference to nucleic acid hybridization the art knows well that numerous 
equivalent conditions may be employed to comprise either low or high stringency con- 
ditions; factors such as the length and nature (DNA, RNA, base composition) of the 
probe and nature of the target (DNA, RNA, base composition, present in solution or 

40 immobilized, etc.) and the concentration of the salts and other components (e.g., the 
presence or absence of formamide, dextran sulfate, polyethylene glycol) are consid- 
ered and the hybridization solution may be varied to generate conditions of either low 
or high stringency hybridization different from, but equivalent to, the above-listed condi- 
tions. Those skilled in the art know that whereas higher stringencies may be preferred 

45 to reduce or eliminate non-specific binding between the nucleotide sequence of SEQ 
ID NOs:1 or 2 and other nucleic acid sequences, lower stringencies may be preferred 
to detect a larger number of nucleic acid sequences having different homologies to the 
nucleotide sequence of SEQ ID NOs:1 and 2. 
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DETAILED DESCRIPTION OF THE INVENTION 

A first subject matter of the invention therefore relates to a transgenic expression con- 
structs for predominant expression of a nucleic acid sequence of interest in substan- 
5 tially all vegetative plant tissues comprising a promoter sequence selected from the 
group consisting of 

a) the promoter of the Pisum sativum ptxA gene, functional equivalent fragments and 
functional equivalent homologs thereof, or their complements, having essentially the 

1 0 same promoter activity as the promoter of the Pisum sativum ptxA gene, and 

b) the promoter of the Glycine max extensin (SbHRGP3) gene, functional equivalent 
fragments and functional equivalent homologs thereof, or their complements, having 
essentially the same promoter activity as the promoter of the Glycine max extensin 

15 (SbHRGP3) gene, 

wherein said promoter sequence is operably linked to a nucleic acid sequence of inter- 
est to be transgenically expressed, and wherein said promoter sequence is heterolo- 
gous with respect to said nucleic acid sequence of interest. 

20 

The sequence of the ptxA gene from Pisum sativum is disclosed (GenBank Acc.-No.: 
X67427). However, the promoter was so far not isolated and combined with other (het- 
erologous) sequences to realized transgenic expression. The promoter region of the 
ptxA gene has approximately 50% nucleotide sequence identity to the promoter region 

25 of Medicago sativa proline-rich protein (MsPRP2) gene. In some regions the sequence 
identity raises even higher, up to 87% over 100 consecutive base pairs. On protein 
level the similarity between the ptxA protein and the MsPRP protein is also very high 
(see Fig. 6 a and b). The ptxA gene encodes for a protein of 352 amino acids. The se- 
quences include 60-80% amino acid identity in 83 residues to the proline-rich proteins 

30 from Medicago truncatula, Lycopersicon esculentum, Sofanum brevidens, Vitis vinifera 
or Zea mays. In the same residues, the sequence alignment shows approximately 50% 
amino acid identity to the probable ceil wail-plasma membrane linker protein (PRP) 
from Arabidopsis thaliana, 48% identity to the protease inhibitor/seed storage/lipid 
transfer protein (LTP) family from Arabidopsis thaliana, 95% identity to salt-inducible 

35 protein RF2 from Medicago sativa, and 53 % identity to root-specific protein RCc3 from 
Oryza sativa. However, the MsPRP2 promoter is described to be both root-specific and 
salt-inducibte (Bastola 1998; WO 99/53016). Both reported expression characteristics 
for the MsPRP2 promoter are significantly different from the expression patterns that 
were observed for the ptxA promoter of this invention. Its vegetative plant tissue/organ 

40 specific, stress-independent expression profile is surprising and unexpected with re- 
spect to the MsPRP2 promoter expression profile. 

Ahn et a/. (1998) reported that the expression of soybean hydroxyproline-rich glycopro- 
tein (SbHRGP3) gene is required for root maturation to terminate root elongation. The 
45 SbHRGP3 gene encodes for a protein of 432 amino acids representing hydroxyproline- 
rich glycoprotein with 50-80% amino acid sequence identity to HRGP or extensin pro- 
tein from Phaseolus vulgaris, Pisum sativum, Solanum tuberosum. Combination of 
wounding and sucrose enhanced expression of this gene in roots. In leaves, both 
wounding and sucrose were required for the expression of SbHRGP3. The sequence of 
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the SbHRGP3 gene is disclosed (GenBank Acc.-No.: U44838). However, the promoter 
was so far not combined with other (heterologous) sequences to realized transgenic 
expression. It is surprising, that a heterologous combination totally changes the ex- 
pression profile of the SbHRGPS promoter described in the art. Both the root specific 
5 and the stress-inducible expression pattern described for the native gene cannot be 
observed in the transgenic expression construct of this invention. This is surprising. 
The change in the expression profile may be explained by the absence of regulatory 
elements (mediating the tissue specificity and stress responsiveness of the native 
gene) not present in the promoter region (but e.g., in introns of the native gene) and/or 
10 by the absence of regulatory proteins in the heterologous plant species 

The promoter sequences of the ptxA or SbHRGP3 gene demonstrate a highly uniform, 
homogenous expression activity in virtually all vegetative plant tissues of various spe- 
cies including dicotyledonous and monocotyledonous plants. In seeds and flowers, 

15 there is no expression activity detectable by GUS staining (see Example 7, and Fig. 3, 
4, and 5) and low expression activity detectable with the more sensitive method of RT- 
PCR (data not shown). Only in plant lines comprising multiple copies of a transgenic 
ptxA-promoter / GUS expression construct some expression can be detected in part of 
the flowers and the siliques (seedpods). It is an advantage that no or very little trans- 

20 genie protein will be expressed in the seed (which is used for food and feed purpose) 
and flowers (which is preferred from an environmental point of view). For numerous 
agronomically valuable traits (e.g., stress resistance, improved water use, resistance 
against fungi or insects, etc.) no expression in seeds and flowers is required. There- 
fore, avoidance of this unnecessary expression may facilitate regulatory approval 

25 and/or consumer acceptance. 

Furthermore, data from the p-glucuronidase (GUS) expression assay suggest, that the 
promoter activity in the vegetative plant tissues and organs at the vegetative stages is 
relatively stronger than at the reproductive stages. In consequence the promoter activ- 

30 ity is most active in the young vulnerable plantlet, but becomes lower in the mature 
plant. This is of an additional advantage, especially for genes which confer resistance 
against biotic or abiotic stress factors (e.g., cold, drought, insect damage, etc.) since 
young, developing plants are considered much more vulnerable against said stress 
factors then mature plants. The promoter activity of the promoters of the invention is 

35 especially high in non-differentiated or de-differentiated tissue or cells like, e.g., callus 
culture. This is very useful for utilizing the promoter in combination with selection 
marker in transformation protocols. 

The invention furthermore relates to a method for transgenic predominant expression 
40 of a nucleic acid sequence of interest in substantially all vegetative plant tissues com- 
prising: 

i. introduction of a transgenic expression construct into a plant cell or a plant, said 
transgenic expression construct comprising a promoter sequence selected from the 
group consisting of 

45 a) the promoter of the Pisum sativum ptxA gene, functional equivalent fragments 
and functional equivalent homologs thereof, or their complements, having essen- 
tially the same promoter activity as the promoter of the Pisum sativum ptxA gene, 
and 



BASF Plant Science GmbH 20040055 pp 55368-2 US 

20 

b) the promoter of the Glycine max extensin (SbHRGP3) gene, functional equivalent 
fragments and functional equivalent homologs thereof, or their complements, 
having essentially the same promoter activity as the promoter of the Glycine max 
extensin (SbHRGP3) gene, 

5 wherein said promoter sequence is operably linked to a nucleic acid sequence of in- 
terest to be transgenically expressed, and wherein said promoter sequence is het- 
erologous with respect to said nucleic acid sequence of interest, 
under conditions such that said nucleic acid sequence of interest is expressed in 
said plant cell and/or predominantly expressed in the vegetative plant tissues and/or 
10 organs of said transgenic plant. 

In a preferred embodiment, the method further comprises ii) identifying or selecting the 
transgenic plant cell comprising said transgenic expression construct. In another pre- 
ferred embodiment, the method further comprises iii) regenerating transgenic plant 
15 tissue from the transgenic plant cell. In an alternative preferred embodiment, the meth- 
ods further comprises iv) regenerating a transgenic plant from the transgenic plant cell. 

Preferably, the promoter sequence utilized in the inventive transgenic expression con- 
structs or methods of the invention is selected from the group of sequences consisting 
20 of: 

a) the promoter of the Pisum sativum ptxA gene as described by SEQ ID NO- 1 or its 
complement, 

b) a functional equivalent fragment of at least 50 consecutive base pairs of the pro- 
moter sequence described by SEQ ID NO: 1, or its complement, having essentially 

25 the same promoter activity as the promoter sequence described by SEQ ID NO: 1 , 

c) a functional equivalent homolog of the promoter sequence described by SEQ ID 
NO: 1 which has essentially the same promoter activity as the promoter sequence 
described by SEQ ID NO: 1 , and has 

i) a homology of at least 95% over a sequence of at least 100 consecutive base 
30 pairs to the sequence as described by SEQ ID NO: 1 and/or 

ii) hybridizes under high stringency conditions with a fragment of at least 50 con- 
secutive base pairs of the a nucleic acid molecule described by SEQ ID NO: 1. 

A preferred functional equivalent fragment of the ptxA promoter comprises a sequence 
35 from about base pair 300 to about base pair 583 of the sequence described by SEQ ID 
NO: 1 . 

In another preferred embodiment, the promoter sequence utilized in the inventive 
transgenic expression constructs or methods of the invention is selected from the 
40 group of sequences consisting of: 

a) the promoter of the Glycine max extensin (SbHRGP3) gene as described by SEQ ID 
NO: 2, or its complement. 
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b) a functional equivalent fragment of at least 50 consecutive base pairs of the pro- 
moter sequence described by SEQ ID NO: 2, or its complement, having essentially 
the same promoter activity as the promoter sequence described by SEQ ID NO: 2, 

5 c) a functional equivalent homolog of the promoter sequence described by SEQ ID 
NO: 2 which has essentially the same promoter activity as the promoter sequence 
described by SEQ ID NO: 2, and has 

i) a homology of at least 60% over a sequence of at least 1 00 consecutive base 
10 pairs to the sequence as described by SEQ ID NO: 2 and/or 

ii) hybridizes under high stringency conditions with a fragment of at least 50 con- 
secutive base pairs of the a nucleic acid molecule described by SEQ ID NO: 2. 

15 A preferred functional equivalent fragment of the SbHRGP3 promoter comprises a 
sequence from about base pair 800 to about base pair 1 179 of the sequence described 
by SEQ ID NO: 2. 

Functional equivalent homologs of the SbHRGPS promoter are for example described 
20 by SEQ ID NO: 7, 8, and 9. While the homologs described by SEQ ID NO: 8 and 9 
only differ in the 5 - and 3'-end of the promoter region, the homolog described by SEQ 
ID NO: 7 also comrpises internal deletions, additions and mutations and was derived 
from a different Glycine max line. A total of 35 nt is different between SEQ ID NO: 7 
and 9. Thus identify (homology) between the two sequences is 97.5%. 

25 

The transgenic expression construct of the invention may comprise further genetic con- 
trol sequences linked operably to the nucleic acid sequence of interest to be expressed 
is to, and/or additional functional elements. 

30 The nucleic acid sequence of interest transgenically expressed from the transgenic 
expression construct of the invention may results in expression of a protein encoded by 
said nucleic acid sequence (by transcription and subsequent translation), and/or 
expression of sense, antisense or double-stranded RNA encoded by said nucleic acid 
sequence of interest. 

35 

In another embodiment, nucleotide sequence encoding the transgenic expression con- 
struct of the invention is double-stranded. In yet another embodiment, the nucleotide 
sequence encoding the transgenic expression construct of the invention is single- 
stranded. 

40 

In yet another alternative embodiment, the transgenic expression construct of the in- 
vention is contained in a vector or in a non-human-organism, preferably a plant cell or a 
plant. In a preferred embodiment, the plant cell is derived from a dicotyledonous or 
monocotyledonous plant. In a yet more preferred embodiment, the monocotyledonous 
45 plant is selected from the group consisting of sugarcane, maize, sorghum, pineapple, 
rice, barley, oat, wheat, rye, yam, onion, banana, coconut, date, and hop. In a yet more 
preferred embodiment, the dicotyledonous plant is selected from the group consisting 
of rapeseed, tobacco, tomato, tagetes (marigold), soybean, pea, common bean, and 
papaya. 
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Further embodiments of the invention relate to the use of a transgenic organism of the 
invention or of celi cultures, parts of transgenic propagation material derived therefrom 
for the production of foodstuffs, animal feeds, seed, pharmaceuticals or fine chemicals. 
Another embodiment of the invention related to a method for production of a foodstuff, 
5 animal feed, seed, pharmaceutical or fine chemical employing a transgenic organism of 
the invention or of cell cultures, parts of transgenic propagation material derived there- 
from. 

Beside the promoter sequences as described by SEQ ID NO 1 or 2, additional se- 
10 quences are subject of the present invention. Another embodiment of the invention is 
the complements of the sequences as described by SEQ ID NO: 1 or 2. It is known in 
the art, that promoter sequences often not only have a unidirectional transcription activ- 
ity but a bi-directional one, so that the complementary sequence also constitutes a 
promoter sequence (Schmulling 1989; Feltkamp 1994; Sadanandom 1996) 

15 

Another embodiments of the invention relate to function equivalent homologs of the 
ptxA or the SbHRGP3 promoter, preferably functional equivalent homologs of the pro- 
moter sequences as described by SEQ ID NO 1 or 2. A "functional equivalent ho- 
molog" of SEQ ID NOs:1 or 2 is defined as a nucleotide sequence having less than 

20 100% homology with SEQ ID NOs: 1 or 2, respectively, and which has promoter activ- 
ity having the essential characteristics (vegetative plant tissue specific expression) of 
the promoter activity of SEQ ID NOs:1 or 2, respectively. Functional equivalent ho- 
mologs of SEQ ID NOs:1 or 2, and of functional equivalent fragments (portions) 
thereof, include, but are not limited to, nucleotide sequences having deletions, inser- 

25 tions or substitutions of different nucleotides or nucleotide analogs as compared to 
SEQ ID NOs:1 or 2, respectively. Functionally equivalent homologs also encompass all 
those sequences which are derived from the complementary counter-strand of the se- 
quence defined by SEQ ID NO: 1 or 2 and having essentially the same promoter activ- 
ity. 

30 

Functional equivalent homologs with regard to the ptxA or SbHRGP3 promoter means, 
in particular, natural or artificial mutations of the ptxA or SbHRGP3 promoter sequence 
described in SEQ ID NO: 1 or 2 or of the deletion variants derived or its homologs from 
other plant genera and plant species which continue to exhibit essentially the same 
35 promoter activity. 

A promoter activity - with respect to the ptxA or SbHRGP3 promoter - is termed essen- 
tially the same when the transcription of any nucleic acid sequence expressed under 
the control of a specific promoter in a plant takes predominantly place in substantially 
40 all vegetative plant tissues and/or organs but is comparatively low or non existing in 
seeds and flowers. 

The term "vegetative plant tissue" or "vegetative organs" as used herein in intended to 
comprise all organs and tissues of a plant beside seeds and flowers (the reproductive 
45 organs leading to development of seeds). 

The term "seed" as used herein means seeds in all developmental stages, preferably a 
mature seed. A mature seed is understood to comprise seeds which have reached 
physiological maturity in all stages between the late stages of seed development to 
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dried seed after harvest. Preferably the term seed means seed in the condition where it 
is normally stored and marketed for feed and food purpose. Such seed may be charac- 
. terized by its water content. Depending on the target plant, the water content in the 
whole seeds can range from about 5% (w/w) (for e.g. dried Arabidopsis seeds) to about 
5 30% (for e.g. whole maize seeds on a fresh weight basis) (Villela 1998). 

The term "flower" as used herein means the reproductive organ of an flowering (angio- 
sperm) plant being that part of a plant destined to produce seed, and hence including 
one or both of the sexual organs; an organ or combination of the organs of reproduc- 
10 tion, whether enclosed by a circle of foliar parts or not. A complete flower consists of 
two essential parts, the stamens and the carpels, and two floral envelopes, the corolla 
and callyx. In mosses the flowers consist of a few special leaves surrounding or sub- 
tending organs called archegonia. 

15 The term "substantially all vegetative plant tissues or organs" means that the accumu- 
lated biomass of organs (or tissues), for which an expression under control of a pro- 
moter of the invention can be detected, adds up for more then 50%, preferably more 
then 80%, more preferably more then 90% of the total biomass of the vegetative or- 
gans (or tissues) (which is the total biomass of the plantlet or plant minus the biomass 

20 of the seed and flowers). Possible are scenarios were one or more vegetative organ (or 
tissues) do not demonstrate are detectable expression. Preferably, expression in the 
vegetative organs occurs at least in stems, leaves, or roots and in undifferentiated cells 
(like, e.g., callus). In a preferred embodiment that term "substantially all vegetative 
plant tissues or organs" means a promoter which has no detectable expression (as for 

25 example judged by employing a promotetfGUS expression cassette) in seed tissue but 
has detectable expression in at least one tissue selected from the group of leafs, stem, 
and roots. More preferably said promoter has expression in leafs, stem and roots (but 
not in seeds). 

30 The term "comparatively low" with respect to expression in the seed and/or flower tis- 
sues or organs, means that the expression rate realized by the transgenic expression 
construct (as measured by any of the methods given below, or exemplified in the Ex- 
amples; preferably by a quantitative p-glucuronidase assay) and normalized to units of 
P-glucuronidase per gram of biomass in seed and/or flower tissue is less the 10% of 

35 the corresponding value in total vegetative plant tissues, preferably less then 5%. 

In a preferred embodiment of the invention, a promoter activity is considered essen- 
tially the same, especially, when the expression rate of a promoter decreases during 
development (i.e. the promoter activity in the tissues and organs at the vegetative 
40 stages is relatively stronger than that at the reproductive stages). 

In the even more preferred embodiment of the invention, a promoter activity is consid- 
ered essentially the same, especially, when the expression rate of a promoter is espe- 
cially high in non-differentiated or de-differentiated tissue or cells like, e.g., callus cul- 
45 ture. 

The expression level of a functional equivalent homolog promoter may be lower or 
higher when compared with a reference value obtained by a promoter as described by 
SEQ ID NO: 1 or 2 in a specific tissue (although the expression pattern remains essen- 
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tially the same). Preferred sequences are those whose expression level, measured on 
the basis of the transcribed mRNA or the protein which is translated as a consequence, 
differs quantitatively by not more than 50%, preferably 25%, especially preferably 10%, 
from a reference value obtained with the promoter described by SEQ ID NO: 1 or 2, 
5 under otherwise unchanged conditions. 

Functional equivalent homologs also comprise those promoter sequences whose func- 
tion, compared with the ptxA or SbHRGP3 promoter as shown in SEQ ID NO: 1 or 2, is 
reduced or increased. In this context, the promoter activity is at least 50% higher, pref- 

10 erably at least 100% higher, especially preferably at least 300% higher, very especially 
preferably at least 500% higher than a reference value obtained with the ptxA or 
SbHRGP3 promoter as shown in SEQ ID NO: 1 or 2 under otherwise unchanged con- 
ditions. Preferably, the activity falls short of that of the ptxA or SbHRGP3 promoter as 
shown in SEQ ID NO: 1 or 2 by not more than 80%, preferably not more than 50%, 

15 especially preferably not more than 20%, very especially preferably not more than 
10%. 

The term "promoter activity" when made in reference to a nucleic acid sequence refers 
to the ability of the nucleic acid sequence to initiate transcription of an operably linked 

20 nucleotide sequence into mRNA. The terms "operably linked," "in operable combina- 
tion," and "in operable order as used herein refer to the linkage of nucleic acid se- 
quences in a manner such that a nucleic acid molecule is capable of directing the tran- 
scription of nucleic acid sequence of interest and/or the synthesis of a polypeptide se- 
quence of interest. Promoter activity may be determined using methods known in the 

25 art. For example, a candidate nucleotide sequence whose promoter activity is to be 
determined is iigated in-frame to a nucleic acid sequence of interest (e.g., a reporter 
gene sequence, a selectable marker gene sequence) to generate a reporter vector, 
introducing the reporter vector into plant tissue using methods described herein, and 
detecting the expression of the reporter gene (e.g., detecting the presence of encoded 

30 mRNA or encoded protein, or the activity of a protein encoded by the reporter gene). 
The reporter gene may express visible markers. Reporter gene systems which express 
visible markers include p-glucuronidase and its substrate (X-Gluc), luciferase and its 
substrate (luciferin), and p-galactosidase and its substrate (X-Gai) which are widely 
used not only to identify transformants, but also to quantify the amount of transient or 

35 stable protein expression attributable to a specific vector system (Rhodes 1995). In a 
preferred embodiment, the reporter gene is a GUS gene. The selectable marker gene 
may confer antibiotic or herbicide resistance. Examples of reporter genes include, but 
are not limited to, the dhfr gene, which confers resistance to methotrexate (Wigier 
1980); npt, which confers resistance to the aminoglycosides neomycin and G-418 (Col- 

40 bere-Garapin 1981) and als or pat, which confer resistance to chlorsulfuron and 
phosphinotricin acetyl transferase, respectively. Detecting the presence of encoded 
mRNA or encoded protein, or the activity of a protein encoded by the reporter gene or 
the selectable marker gene indicates that the candidate nucleotide sequence has pro- 
moter activity. 

45 

The term "otherwise unchanged conditions" means - for example - that the expression 
which is initiated by one of the expression constructs to be compared is not modified by 
combination with additional genetic control sequences, for example enhancer se- 
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quences and is done in the same environment (e.g., the same plant species) at the 
same developmental stage and under the same growing conditions. 

Functional equivalent homologs with regard to the ptxA or SbHRGP3 promoter means, 
5 in particular, natural or artificial mutations of the ptxA or SbHRGP3 promoter sequence 
described in SEQ ID NO: 1 or 2 or of the deletion variants derived or its homologs from 
other plant genera and plant species which continue to exhibit essentially the same 
promoter activity. 

10 Mutations encompass substitutions, additions, deletions, inversions or insertions of one 
or more nucleotide residues. Thus, those nucleotide sequences which are obtained by 
modification of the ptxA or SbHRGP3 promoter as shown in SEQ ID NO: 1 or 2 are 
also encompassed by the present invention. Aim of such a modification may be the 
further delimitation of the sequence comprised therein or else, for example, the intro- 

15 duction of further cleavage sites for restriction enzymes, the removal of excess DNA or 
the addition of further sequences, for example further regulatory sequences. 

Where insertions, deletions or substitutions such as, for example, transitions and trans- 
versions are suitable, techniques known per se such as in vitro mutagenesis, primer 
20 repair, restriction or ligation may be used. In the case of suitable manipulations such 
as, for example, restriction, chewing back or filling in overhangs for blunt ends, com- 
plementary ends of the fragments may be provided for ligation. Analogous results may 
also be achieved using the polymerase chain reaction (PGR) using specific oligonu- 
cleotide primers. 

25 

Functional equivalent homologs of a promoter sequence as described by SEQ ID NO: 
1 (for example by substitution, insertion or deletion of nucleotides; or representing a 
homologous promoter from another plant species) have at least 60% homology, pref- 
erably at least 80% homology, by preference at least 90% homology, especially pref- 

30 erably at least 95% homology, very especially preferably at least 98% homology - but 
less then 100% homology - to the promoter sequence as described by SEQ ID NO: 1, 
wherein said homology is determined over a sequence of at least 700 consecutive 
base pairs, preferably at least 800 consecutive base pairs, more preferably at least 850 
consecutive base pairs of the sequence as described by SEQ ID NO: 1, and are having 

35 essentially the same promoter activity characteristics as the ptxA promoter as shown in 
SEQ ID NO: 1. 

In an preferred embodiment, functional equivalent homologs of a promoter sequence 
as described by SEQ ID NO: 1 (for example by substitution, insertion or deletion of 

40 nucleotides; or representing a homologous promoter from another plant species) have 
at least 90% homology, preferably at least 95% homology, by preference at least 97% 
homology, especially preferably at least 98% homology, very especially preferably at 
least 99% homology - but less then 100% homology - to the promoter sequence as 
described by SEQ ID NO: 1 , wherein said homology is determined over a sequence of 

45 at least 300 consecutive base pairs, preferably at least 400 consecutive base pairs, 
more preferably at least 500 consecutive base pairs of the sequence as described by 
SEQ ID NO: 1, and are having essentially the same promoter activity characteristics as 
. the ptxA promoter as shown in SEQ ID NO: 1 . 
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In an more preferred embodiment, functional equivalent homologs of a promoter se- 
quence as described by SEQ ID NO: 1 (for example by substitution, insertion or dele- 
tion of nucleotides; or representing a homologous promoter from another plant species) 
have at least 95% homology, preferably at least 96% homology, by preference at least 
5 97% homology, especially preferably at least 98% homology, very especially preferably 
at least 99% homology ~ but less then 100% homology - to the promoter sequence as 
described by SEQ ID NO: 1, wherein said homology is determined over a sequence of 
at least 100 consecutive base pairs, preferably at least 200 consecutive base pairs, 
more preferably at least 500 consecutive base pairs of the sequence as described by 
10 SEQ ID NO: 1, and are having essentially the same promoter activity characteristics as 
the ptxA promoter as shown in SEQ ID NO: 1. 

Functional equivalent homologs of a promoter sequence as described by SEQ ID NO: 

1 are not to be understood to include the promoter of the MsPRP2 gene from alfalfa 
1 5 (Bastola 1998; WO 99/53016). 

Functional equivalent homologs of a promoter sequence as described by SEQ ID NO: 

2 (for example by substitution, insertion or deletion of nucleotides; or representing a 
homologous promoter from another plant species) have at least 60% homology, pref- 

20 erably at least 80% homology, by preference at least 90% homology, especially pref- 
erably at least 95% homology, very especially preferably at least 98% homology ~ but 
less then 100% homology - to the promoter sequence as described by SEQ ID NO: 2, 
wherein said homology is determined over a sequence of at least 100 consecutive 
base pairs, preferably at least 200 consecutive base pairs, more preferably at least 500 

25 consecutive base pairs of the sequence as described by SEQ ID NO: 2, and are having 
essentially the same promoter activity characteristics as the SbHRGP3 promoter as 
shown in SEQ ID NO: 2. 

Further examples of promoter sequences employed in the expression constructs or 
30 expression vectors according to the invention can be found readily in different organ- 
isms whose genomic sequence is known such as, for example, Arabidopsis thaliana, 
Brassica napus, Nicotiana tabacum, Soianum tuberosum, Helianthus anuus, Linum 
sativum from databases by homology alignment. 

35 Functional equivalents is furthermore understood as meaning DNA sequences which 
hybridize under high stringency conditions with the nucleic acid sequence encoding the 
ptxA or SbHRGP3 promoter as shown in SEQ ID NO: 1 or 2 or the nucleic acid se- 
quences complementary thereto and which have essentially the same promoter activ- 
ity. Preferred are promoter sequences which hybridize under high stringency conditions 

40 (as defined above) with a fragment of at least 50 consecutive nucleotide, preferably at 
least 100 consecutive nucleotide, more preferably at least 200 consecutive nucleotide, 
most preferably at least 500 consecutive nucleotide of a sequence as described by 
SEQ ID NO: 1 or 2 (or a fragment of the same preferred length of the complementary 
strand of the sequence as described by SEQ ID NO: 1 or 2). In a preferred embodi- 

45 ment this fragment is selected starting from the alleged transcription start and the 
length is calculated in upstream direction (/.e. away from the corresponding ATG- 
codon). 
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Methods for the generation of artificial functional equivalent homologs according to the 
invention preferably comprise the introduction of mutations into the ptxA or SbHRGP3 
promoter as shown in SEQ ID NO: 1 or 2. Mutagenesis can be random, the 
mutagenized sequences subsequently being screened in a "trial-and-error" procedure 
5 for their characteristics. Methods for the mutagenic treatment of nucleic acid se- 
quences are known to the skilled worker and include, for example, the use of oligonu- 
cleotides with one or more mutations in comparison with the region to be mutated (for 
example in a "site-specific mutagenesis"). Typically, primers with approximately 15 to 
approximately 75 nucleotides or more are employed, with approximately 10 to ap- 

10 proximately 25 or more nucleotide residues preferably being located on both sides of 
the sequence to be modified. Details and the procedure of said mutagenesis methods 
are known to the skilled worker (Kunkel 1987; Tomic 1990; Upender 1995; US 
4,237,224). A mutagenesis can also be carried out by treating for example vectors 
comprising one of the nucleic acid sequences according to the invention with mutagens 

15 such as hydroxylamine. 

Natural occurring functional equivalent homologs can be identified and isolated either 
starting from the promoter sequences as described by SEQ ID NO: 1 or 2 or - alterna- 
tively - by starting from the corresponding protein encoding sequences. The latter are 
20 normally demonstrating higher significant homologies and allow for easier identification 
of corresponding genes in other plant species. 

Examples for functional equivalent promoter sequences which can be employed in the 
transgenic expression cassettes of the invention can be identified and/or isolated from 
25 organism which genomic sequence is known (e.g., Arabidopsis thaliana, Brassica 
napus, Nicotiana tabacum, Solanum tuberosum, Helianthium annuus, Linum sativum) 
by homology search in the corresponding databases. Preferably, the person skilled in 
the art will start such analysis based on the coding regions of the genes, which pro- 
moters are described by SEQ ID NO: 1 or 2. 

30 

Natural occurring functional equivalent promoter sequences can be identified and iso- 
lated by multiple methods known in the art. For example, probes or primers derived 
from either the promoter sequences as described by SEQ ID NO: 1 or 2 or the corre- 
sponding protein encoding sequences can be employed to screen libraries of genomic 
35 DNA clones. 

As used herein, the term "probe" refers to an oligonucleotide, whether occurring natu- 
rally as in a purified restriction digest or produced synthetically, recombinantly or by 
PCR amplification, which is capable of hybridizing to a nucleotide sequence of interest. 

40 A probe may be single-stranded or double-stranded. It is contemplated that any probe 
used in the present invention will be labeled with any "reporter molecule," so that it is 
detectable in any detection system including, but not limited to enzyme (e.g., ELISA, as 
well as enzyme-based histochemical assays), fluorescent, radioactive, calorimetric, 
gravimetric, magnetic, and luminescent systems. It is not intended that the present in- 

45 vention be limited to any particular detection system or label. 

The probes provided herein are useful in the detection, identification and isolation of, 
for example, sequences such as those listed as SEQ ID NOs:1 or 2 as well as of ho- 
mologs thereof. Preferred probes are of sufficient length (e.g., from about 9 nucleotides 
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to about 20 nucleotides or more in length) such that high stringency hybridization may 
be employed. In one embodiment, probes from 20 to 50 nucleotide bases in length are 
employed. 

Similar a portion of the nucleic acid sequences set forth as SEQ ID NOs:1 or 2 can be 
used as a primer for the amplification of nucleic acid sequences useful as function 
equivalent homologs by, for example, polymerase chain reactions (PGR) or reverse 
transcription-polymerase chain reactions (RT-PCR). The term "amplification" is defined 
as the production of additional copies of a nucleic acid sequence and is generally car- 
ried out using polymerase chain reaction technologies well known in the art (Dieffen- 
bach 1995). With PCR, it is possible to amplify a single copy of a specific target se- 
quence in genomic DNA to a level detectable by several different methodologies (e.g., 
hybridization with a labeled probe; incorporation of biotinylated primers followed by 
avidin-enzyme conjugate detection; and/or incorporation of 32 P-labeled deoxyribonu- 
cleotide triphosphates, such as dCTP or dATP, into the amplified segment). 

As used herein, the term "primer" refers to an oligonucleotide, whether occurring natu- 
rally as in a purified restriction digest or produced synthetically, which is capable of 
acting as a point of initiation of synthesis when placed under conditions in which syn- 
thesis of a primer extension product which is complementary to a nucleic acid strand is 
induced, (/.e., in the presence of nucleotides and an inducing agent such as DNA po- 
lymerase and at a suitable temperature and pH). The primer is preferably single 
stranded for maximum efficiency in amplification, but may alternatively be double 
stranded. If double stranded, the primer is first treated to separate its strands before 
being used to prepare extension products. Preferably, the primer is an oligodeoxyribo- 
nucleotide. The primer must be sufficiently long (e.g., from about 9 nucleotides to about 
20 nucleotides or more in length) to prime the synthesis of extension products in the 
presence of the inducing agent. Suitable lengths of the primers may be empirically de- 
termined and depend on factors such as temperature, source of primer and the use of 
the method. In one embodiment, the present invention employs probes from 20 to 50 
nucleotide bases in length. 

The invention also contemplates functional equivalent fragments of SEQ ID NOs:1 or 2, 
(and functional equivalent homologs thereof) having essentially the same promoter 
activity. Functional equivalent fragments of ptxA or SbHRGP3 promoter can be pro- 
duced preferably by eliminating (deleting) non-essential sequences and restricting the 
original sequence to those comprising promoter elements affecting promoter activity, 
but without adversely affecting the abovementioned characteristics to a significant ex- 
tent. Such functional equivalent fragments are also termed "core promoter" or "core 
promoter region" herein. 

Sequences within a promoter which affect promoter activity may be determined by us- 
ing deletion constructs such as those described by Sherri et al. (US 5,593,874). Briefly, 
several expression plasmids are constructed to contain a reporter gene under the regu- 
latory control of different candidate nucleotide sequences which are obtained either by 
restriction enzyme deletion of internal sequences in SEQ ID NOs:1 or 2, restriction en- 
zyme truncation of sequences at the 5* and/or 3' end of SEQ ID NOs:1 or 2, or by the 
introduction of single nucleic acid base changes by PCR into SEQ ID NOs:1 or 2. Ex- 
pression of the reporter gene by the deletion constructs is detected. Detection of ex- 
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pression of the reporter gene in a given deletion construct indicates that the candidate 
nucleotide sequence in that deletion construct has promoter activity. 

Alternatively or in combination, restricting (cutting down) of the ptxA or SbHRGP3 pro- 
5 moter sequence to specific essential regulatory regions can also be achieved with the 
aid of search routine for the search of promoter elements. Specific promoter elements 
are frequently accumulated in the regions which are relevant for promoter activity. This 
analysis can be carried out for example with computer programs such as the program 
PLACE ("Plant Cis-acting Regulatory DNA Elements") (Higo 1999). 

10 

The core region of the ptxA promoter described by SEQ ID NO: 1 was determined by 
promoter element analysis based on the PLACE algorithm (see Example 14). Based on 
the below given PLACE results are potential TATA box is localized at base pair 549 to 
base pair 554 of SEQ ID NO: 1. In consequence the 5' untranslated region starts at 

15 about base pair 584 and extends to base pair 863 of SEQ ID NO: 1. The sequence 
described by SEQ ID NO: 1 end just before the ATG start codon. It is known for the 
person skilled in the art, that the 5' untranslated region is not part of the promoter. 
Therefore, this 5' untranslated region may be deleted to obtain a function equivalent 
fragment of the ptxA promoter as described by SEQ ID NO: 1 . Based on the promoter 

20 element analysis there seem to be no clusters of promoter elements in the first 300 
base pairs of the sequence described by SEQ ID NO: 1. Therefore, this region may be 
deleted to obtain a function equivalent fragment of the ptxA promoter as described by 
SEQ ID NO: 1. It is therefore very likely that the core region of the ptxA promoter ex- 
tents from about base pair 300 to about base pair 583 of the sequence described by 

25 SEQ ID NO: 1. 

Therefore, in a preferred embodiment of the invention a functional equivalent promoter 
fragment of the ptxA promoter as described by SEQ ID NO: 1 comprises a sequences 
described by the sequence of about base pair 300 to about 863 of SEQ ID NO: 1 or the 
30 sequence of about base pair 1 to about 583 of SEQ ID NO: 1 t preferably the sequence 
of about base pair about 300 to about 863 of SEQ ID NO: 1 . 

The core region of the SbHRGP3 promoter described by SEQ ID NO: 2 was deter- 
mined by promoter element analysis based on the PLACE algorithm (see Example 15). 

35 Based on the below given PLACE results are potential TATA box is localized at base 
pair 1147 to base pair 1152 of SEQ ID NO: 2. In consequence the 5' untranslated re- 
gion starts at about base pair 1179 and extends to base pair 1380 of SEQ ID NO: 2. 
The sequence described by SEQ ID NO: 2 ends 12 base pairs before the ATG start 
codon. It is known for the person skilled in the art, that the 5' untranslated region is not 

40 part of the promoter. Therefore, this 5' untranslated region may be deleted to obtain a 
function equivalent fragment of the SbHRGP3 promoter as described by SEQ ID NO: 
2. Based on the promoter element analysis there seem to be no clusters of promoter 
elements in the first 800 base pairs of the sequence described by SEQ ID NO: 2. 
Therefore, this region may be deleted to obtain a function equivalent fragment of the 

45 SbHRGP3 promoter as described by SEQ ID NO: 2. It is therefore very likely that the 
core region of the SbHRGP3 promoter extents from about base pair 800 to about base 
pair 1 179 of the sequence described by SEQ ID NO: 2. 
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Therefore, in an preferred embodiment of the invention a functional equivalent pro- 
moter fragment of the SbHRGP3 promoter as described by SEQ ID NO: 2 comprises a 
sequences described by the sequence of about base pair 800 to about 1380 of SEQ ID 
NO: 2 or the sequence of about base pair 1 to about 1179 of SEQ ID NO: 2, preferably 
5 the sequence of about base pair about 800 to about 1 1 79 of SEQ ID NO: 2. 

The nucleotide sequence of SEQ ID NOs: 1 or 2, fragments, homologs and antisense 
sequences thereof may be synthesized by synthetic chemistry techniques which are 
commercially available and well known in the art [see Caruthers 1980; Horn 1980). 

10 Additionally, fragments of SEQ ID NOs:1 or 2 can be made by treatment of SEQ ID 
NOs:1 or 2 with restriction enzymes followed by purification of the fragments by gel 
electrophoresis. Alternatively, sequences may also produced using the polymerase 
chain reaction (PCR) as described by MuIIis [US 4,683,195, 4,683,202 and 4,965,188, 
all of which are hereby incorporated by reference]. SEQ ID NOs:1 or 2, portions, ho- 

15 mologs and antisense sequences thereof may be ligated to each other or to heterolo- 
gous nucleic acid sequences using methods well known in the art. 

The nucleotide sequence of synthesized sequences may be confirmed using commer- 
cially available kits as well as using methods well known in the art which utilize en- 
20 zymes such as the Klenow fragment of DNA polymerase I, Sequenase®, Taq DNA po- 
lymerase, or thermostable T7 polymerase. Capillary electrophoresis may also be used 
to analyze the size and confirm the nucleotide sequence of the products of nucleic acid 
synthesis, restriction enzyme digestion or PCR amplification. 

25 THE EXPRESSION CONSTRUCT OF THE INVENTION 

In the transgenic expression construct of the invention one or more of the promoter 
sequences described above are operably linked (as defined above) to a nucleic acid of 
interest. 

30 Beside the promoter sequence and the nucleic acid of interest operably linked thereto, 
the expression construct of the invention may comprise further genetic control se- 
quences (as defined below in detail). For example, at the 3' end of the nucleic acid se- 
quence of interest, other DNA sequences may also be included, e.g., a 3* untranslated 
region containing a polyadenylation site and transcription termination sites. Further 

35 sequences which, for example, act as a linker with specific cleavage sites for restriction 
enzymes, or as a signal peptide, may be positioned between the promoter and the nu- 
cleic acid sequence of interest. For example, an expression construct according to the 
invention is generated by fusing the ptxA or SbHRGP3 promoter (or a functional 
equivalent or functionally equivalent portion as shown in SEQ-ID NO: 1 or 2 or a func- 

40 tional equivalent) to a nucleic acid sequence to be expressed, and a terminator signal 
or polyadenylation signal. 

A transgenic expression cassette of the invention (or a transgenic vector comprising 
said transgenic expression cassette) can be produced by means of customary recom- 
45 bination and cloning techniques as are described (for example, in Maniatis 1 989; Sil- 
havy 1984; and in Ausubel 1987). 

However, a transgenic expression construct of the invention is also understood as 
meaning those constructs in which a promoter of the invention is introduced into a host 
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genome without previously having been linked operably to a nucleic acid sequence of 
interest to be expressed (e.g., via directed homologous recombination or random inser- 
tion), and then, in this host genome, takes on regulatory control over the nucleic acid 
sequences to which it is now linked operably, and governs the transgenic expression of 
5 the latter. By inserting the promoter, for example by homologous recombination, up- 
stream of an endogenous nucleic acid encoding a specific polypeptide, an expression 
construct according to the invention is obtained which governs the expression of the 
specific polypeptide in the vegetative plant tissues. Furthermore, insertion of the pro- 
moter may also be effected in such a manner that RNA which is antisense to the nu- 
10 cleic acid encoding a specific polypeptide is expressed. This selectively down-regulates 
or switches off expression of the specific polypeptide in the vegetative plant tissues. 

Analogously, a nucleic acid sequence of interest to be expressed recombinantly may 
also be placed downstream of the endogenous natural ptxA or SbHRGP3 promoter (or 
15 a function equivalent homolog thereof in another plant species), for example by ho- 
mologous recombination, whereby an expression construct according to the invention 
is obtained which governs the expression, of the nucleic acid sequence to be ex- 
pressed recombinantly, in the cotyledons of the plant embryo. 

20 FURTHER GENETIC CONTROL SEQUENCES 

The transgenic expression construct of the invention may comprise further genetic con- 
trol sequences in addition to the inventive promoter. The term "genetic control se- 
quences" is to be understood in the broad sense and refers to all those sequences 
which have an effect on the materialization, production, propagation, replication, or the 

25 function of the expression construct according to the invention. For example, genetic 
control sequences modify the transcription and translation in prokaryotic or eukaryotic 
organisms. Preferably, the expression constructs according to the invention encom- 
pass a promoter functional in plants 5'-upstream of the nucleic acid sequence in ques- 
tion to be expressed recombinantly, and 3'-downstream a terminator sequence as addi- 

30 tional genetic control sequence and, if appropriate, further customary regulatory ele- 
ments, in each case linked operably to the nucleic acid sequence to be expressed re- 
combinantly. 

Genetic control sequences furthermore also encompass the 5'-untranslated regions, 
35 introns or non-coding 3'-region of genes, such as, for example, the actin-1 intron, or the 
Adh1-S introns 1, 2 and 6 (general reference: The Maize Handbook, Chapter 116, 
Freeling and Walbot, Eds., Springer, New York (1994)). It has been demonstrated that 
they may play a significant role in the regulation of gene expression. Thus, it has been 
demonstrated that 5'-untranslated sequences can enhance the transient expression of 
40 heterologous genes. Examples of translation enhancers which may be mentioned are 
the tobacco mosaic virus 5' leader sequence (Gallie 1987) and the like. Furthermore, 
they may promote tissue specificity (Rouster 1998). 

The expression construct may advantageously comprise one or more enhancer se- 
45 quences, linked operably to the promoter, which make possible an increased recombi- 
nant expression of the nucleic acid sequence. Additional advantageous sequences, 
such as further regulatory elements or terminators, may also be inserted at the 3' end 
of the nucleic acid sequences to be expressed recombinantly. Polyadenylation signals 
which are suitable as control sequences are plant polyadenylation signals, preferably 
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those which essentially correspond to T-DNA polyadenylation signals from Agrobacte- 
rium tumefaciens, in particular the OCS (octopin synthase) terminator and the NOS 
(nopalin synthase) terminator. 

5 Control sequences are furthermore understood as meaning those sequences which 
make possible a homologous recombination or insertion into the genome of a host or- 
ganism or which permit the removal from the genome. In the case of homologous re- 
combination, for example, the ptxA or SbHRGP3 promoter may be substituted for the 
natural promoter of an endogenous gene. Using homologous recombination, a pro- 

10 moter of the invention can be placed before the target gene to be transgenically ex- 
pressed (e.g., an endogenous plant gene), by linking said promoter to DNA sequences 
which are homologous to, for example, endogenous sequences upstream of the read- 
ing frame of the target gene. Such sequences count as genetic control sequences. 
After a cell has been transformed with the DNA construct in question, the two homolo- 

15 gous sequences can interact and thus place the promoter of the invention at the de- 
sired site before the target gene so that the promoter sequence of the invention be- 
comes operably linked to the target gene and constitutes an expression construct of 
the invention. The choice of the homologous sequences determines the insertion type 
of the promoter. In this case the expression construct can be generated by homologous 

20 recombination by means of a singly- or doubly-reciprocal recombination. In the case of 
the singly-reciprocal recombination, only an individual recombination sequence is used, 
and all of the DNA introduced is inserted. In the case of the double-reciprocal recombi- 
nation, the DNA to be introduced is flanked by two homologous sequences, and the 
flanking region is inserted. The latter method is suitable for substituting the ptxA or 

25 SbHRGP3 promoter for the natural promoter of a specific gene, as described above, 
and thus modifying the natural expression profile of this gene. This operable linkage 
constitutes an expression construct according to the invention. 

Homologous recombination is a relatively rare event in higher eukaryotes, especially in 
30 plants. Random integrations into the host genome predominate. A possibility of remov- 
ing the randomly integrated sequences and thus accumulating cell clones with a cor- 
rect homologous recombination is the use of a sequence- specific recombination sys- 
tem as described in US 6,1 1 0,736. 

35 Control sequences are furthermore to be understood as those permitting removal of the 
inserted sequences from the genome. Methods based on the cre/lox (Sauer 1998; 
Odell 1990; Dale 1991), FLP/FRT (Lysnik 1993), or Ac/Ds system (Wader 1987; US 
5,225,341; Baker 1987; Lawson 1994) permit a - if appropriate tissue-specific and/or 
inducible - removal of a specific DNA sequence from the genome of the host organism. 

40 Control sequences may in this context mean the specific flanking sequences (e.g., lox 
sequences), which later allow removal (e.g., by means of ere recombinase). In this 
case, specific flanking sequences (lox sequences), which later allow removal by means 
of ere recombinase, attach to the target gene. 

45 Furthermore, other elements having influence on the performance of an expression 
construct or a vector are included under the term control sequences. Such control se- 
quences may include 

a) Origins of replication, which ensure amplification of the expression constructs or 



BASF Plant Science GmbH 



20040055 
33 



PF 55368-2 US 



vectors according to the invention in, for example, E. colL Examples which may be 
mentioned are ORI (origin of DNA replication), the pBR322 ori or the P15A ori (Ma- 
niatis1989). 

5 b) Elements which are necessary for Agrobacterium-medlaXed plant transformation, 
such as, for example, the right or left border of the T-DNA or the vir region. 

c) Multiple cloning regions (MCS) permit and facilitate the insertion of one or more nu- 
cleic acid sequences. 

10 

Control sequences further comprise sequences which allow for transport of the ex- 
pressed protein into specific cell compartment, such as, for example, the endomem- 
brane system, the vacuole, or the plastids (e.g., the chloroplasts). Desired glycosylation 
reactions, specific folding and the like, are possible by exploiting the secretory path- 

15 way. Alternative possibilities are the secretion of the target protein towards the cell sur- 
face or secretion into the culture medium, for example when using cells or protoplasts 
grown in suspension culture. The targeting sequences required for this purpose can be 
incorporated into the expression construct or vector of the invention in combination with 
the nucleic acid sequence of interest. Target sequences which can be used are ho- 

20 mologous (with respect to the nucleic acid of interest - if present) or heterologous se- 
quences. Targeting sequences are known for subcellular localization in apoplasts, 
vacuole, plastids, mitochondrion, endoplasmic reticulum (ER), nucleus, elaioplasts, and 
other compartments. The method for the targeted transport into plastids of proteins 
which perse are not localized in the plastids is described (Klosgen & Weil 1991; Van 

25 Breusegem 1998). 

Genetic control sequences also encompass further promoters, promoter elements or 
minimal promoters, all of which can modify or enhance the expression-governing char- 
acteristics. Thus, for example, the tissue-specific expression may additionally depend 

30 on certain stresses, owing to genetic control sequences. Such elements have been 
described, for example, for water stress, abscisic acid (Lam & Chua 1991) and thermal 
stress (SchdffI 1989). For example, the expression of dicotyledonous promoter can be 
feasible in monocotyledonous plants in combination with an intron. Such intron is fused 
in 5' untranslated region, in general, can or cannot be spliced during transcription, 

35 which enhances the expression (Callis 1987; Clancy & Hannah 2002; Le 2003; Lork- 
ovic 2000; Luehrsen & Walbot 1991; McEloy & Wu US 6,429,357). 

Further promoters which make possible an expression in further plant tissues or in 
other organisms such as, for example, E.coli bacteria, may furthermore be linked oper- 

40 ably to the nucleic acid sequence to be expressed. Suitable plant promoters are, in 
principle, all of the above-described promoters. For example, it is feasible that a spe- 
cific nucleic acid sequence is transcribed by a promoter (for example the ptxA or 
SbHRGP3 promoter) as sense RNA in a plant tissue and translated into the corre- 
sponding protein, while the same nucleic acid sequence is transcribed by another pro- 

45 moter with another specificity in another tissue into antisense RNA and the correspond- 
ing protein is down regulated. This can be effected by an expression construct accord- 
ing to the invention, by positioning the first promoter before the nucleic acid sequence 
to be expressed recombinantly, and the other promoter there behind. 
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PREFERRED NUCLEIC ACID OF INTEREST 

Preferably, the transgenic expression construct of the invention to be inserted into the 
genome of the target plant comprises at least one expression construct, which may - 
for example - facilitate expression of selection markers, trait genes, antisense RNA or 
5 double-stranded RNA. Preferably said expression constructs comprise a promoter se- 
quence functional in plant cells (either - and preferably - a promoter of the invention or 
another suitable promoter as for example described in the BACKGROUND FOR THE 
INVENTION) operatively linked to a nucleic acid sequence which - upon expression - 
confers an advantageous phenotype to the so transformed plant. The person skilled in 

10 the art is aware of numerous sequences which may be utilized in this context, e.g. to 
increase quality of food and feed, to produce chemicals, fine chemicals or pharmaceu- 
ticals (e.g., vitamins, oils, carbohydrates; Dunwell 2000), conferring resistance to herbi- 
cides, or conferring male sterility. Furthermore, growth, yield, and resistance against 
abiotic and biotic stress factors (like e.g., fungi, viruses, nematodes, or insects) may be 

15 enhanced. Advantageous properties may be conferred either by overexpressing pro- 
teins or by decreasing expression of endogenous proteins by e.g., expressing a corre- 
sponding antisense (Sheehy 1988; US 4,801,340; Mol 1990) or double-stranded RNA 
(Matzke 2000; Fire 1998; Waterhouse 1998; WO 99/3261 9; WO 99/53050; 
WO 00/68374; WO 00/44914; WO 00/44895; WO 00/49035; WO 00/63364). Nucleic 

20 acids of interest may encode for the following (but shall not be limited to): 

1. Selection markers 

Selection markers are useful to select and separate successfully transformed or ho- 
mologous recombined cells. 

25 

1.1 Positive selection markers 

Selection markers confer a resistance to a biocidal compound such as a metabolic in- 
hibitor (e.g., 2-deoxyg!ucose-6-phosphate, WO 98/45456), antibiotics (e.g., kanamycin, 
G418, bleomycin or hygromycin) or herbicides (e.g., phosphinothricin or glyphosate). 
30 Especially preferred selection markers are those which confer resistance to herbicides. 
Examples which may be mentioned are: 

Phosphinothricin acetyltransferases (PAT; also named Bialophos ®resistance; bar; 
de Block 1987; EP 0 333 033; US 4,975,374) 

5-enolpyruvyIshikimate-3-phosphate synthase (EPSPS) conferring resistance to 
35 Glyphosate® (N-(phosphonomethyl)glycine) (Shah 1986) 

Glyphosate® degrading enzymes (Glyphosate® oxidoreductase; gox), 
Dalapon® inactivating dehalogenases (deh) 

sulfonylurea- and imidazolinone-inactivating acetolactate synthases (for example 
mutated ALS variants with, for example, the S4 and/or Hra mutation 
40 - Bromoxynil® degrading nitrilases (bxn) 

Kanamycin- or. G418- resistance genes (NPTll; NPTI) coding e.g., for neomycin 
phosphotransferases (Fraley 1983) 

2-Desoxyglucose-6-phosphate phosphatase (DOG R 1-Gene product; WO 
98/45456; EP 0 807 836) conferring resistance against 2-desoxyglucose (Randez- 
45 Gil 1995). 

hygromycin phosphotransferase (HPT), which mediates resistance to hygromycin 

(Vanden Elzen 1985). 

dihydrofolate reductase (Eichholtz 1 987) 



BASF Plant Science GmbH 20040055 pf 55368-2 US 

35 

Additional positive selectable marker genes of bacterial origin that confer resistance to 
antibiotics include the aadA gene, which confers resistance to the antibiotic spectino- 
mycin, gentamycin acetyl transferase, streptomycin phosphotransferase (SPT), ami- 
noglycoside-3-adenyl transferase and the bleomycin resistance determinant (Hayford 
5 1 988; Jones 1 987; Svab 1 990; Hille 1 986). 

Genes like isopentenyltransferase from Agrobacterium tumefaciens (strain:P022; 
Genbank Acc.-No.: AB025109) may - as a key enzyme of the cytokinin biosynthesis - 
facilitate regeneration of transformed plants (e.g., by selection on cytokinin-free me- 

10 dium). Corresponding selection methods are described (Ebinuma 2000a; Ebinuma 
2000b). Additional positive selection markers, which confer a growth advantage to a 
transformed plant in comparison with a non-transformed one, are described e.g., in EP- 
A 0 601 092. Growth stimulation selection markers may include (but shall not be limited 
to) p-glucuronidase (in combination with e.g., a cytokinin glucuronide), mannose-6- 

15 phosphate isomerase (in combination with mannose), UDP-galactose-4-epimerase (in 
combination with e.g., galactose), wherein mannose-6-phosphate isomerase in combi- 
nation with mannose is especially preferred. 

1 .2) Negative selection markers 

20 Negative selection markers are especially suitable to select organisms with defined 
deleted sequences comprising said marker (Koprek 1999). Examples for negative se- 
lection marker comprise thymidin kinases (TK), cytosine deaminases (Gleave 1999; 
Perera1993; Stougaard 1993), cytochrom P450 proteins (Koprek 1999), haloalkan 
dehalogenases (Naested 1999), iaaH gene products (Sundaresan 1995), cytosine 

25 deaminase codA (Schlaman & Hooykaas 1997), or tms2 gene products (Fedoroff & 
Smith 1993). 

2) Reporter genes 

Reporter genes encode readily quantifiable proteins and, via their color or enzyme ac- 

30 tivity, make possible an assessment of the transformation efficacy, the site of expres- 
sion or the time of expression. Very especially preferred in this context are genes en- 
coding reporter proteins (Schenborn 1999) such as the green fluorescent protein (GFP) 
(Sheen 1995; Haseloff 1997; Reichel 1996; Tian 1997; WO 97/41228; Chui 1996; Lef- 
fel 1997), chloramphenicol transferase, a luciferase (Ow 1986; Millar 1992), the 

35 aequorin gene (Prasher 1985), p-galactosidase, R locus gene (encoding a protein 
which regulates the production of anthocyanin pigments (red coloring) in plant tissue 
and thus makes possible the direct analysis of the promoter activity without addition of 
further auxiliary substances or chromogenic substrates (Dellaporta 1988; Ludwig 
1990), with p-glucuronidase (GUS) being very especially preferred (Jefferson 

40 1987a,b). p-giucuronidase (GUS) expression is detected by a blue color on incubation 
of the tissue with 5-bromo-4-ch!oro-3-indo!yl-p-D-glucuronic acid, bacterial luciferase 
(LUX) expression is detected by light emission; firefly luciferase (LUC) expression is 
detected by light emission after incubation with luciferin; and galactosidase expression 
is detected by a bright blue color after the tissue is stained with 5-bromo-4-chloro-3- 

45 indolyl-p-D-galactopyranoside. Reporter genes may also be used as scorable markers 
as alternatives to antibiotic resistance markers. Such markers are used to detect the 
presence or to measure the level of expression of the transferred gene. The use of 
scorable markers in plants to identify or tag genetically modified cells works well only 
when efficiency of modification of the cell is high. 
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The skilled worker is familiar with a multiplicity of nucleic acids of interest (or proteins 
encoded thereby) whose transgenic expression is advantageous. The skilled worker is 
furthermore familiar with a multiplicity of genes by whose repression or silencing by 
means of expression of a corresponding antisense or double stranded RNA advanta- 
5 geous effects may also be achieved. The following may be mentioned by way of exam- 
ple, but not by way of limitation, as advantageous effects: 

- Obtaining a resistance to abiotic stresses (high and low temperatures, drought, in- 
creased humidity, environmental toxins, UV radiation) 

10 - Obtaining a resistance to biotic stresses (pathogens, viruses, insects and diseases) 

- Obtaining resistance against phytotoxic substances or herbicides 

- Improving the growth rate or the yield. 



The following may be mentioned by way of example but not by way of limitation as nu- 
15 cleic acid sequences or polypeptides which can be used for these applications: 

1. Improved protection of the plant embryo against abiotic stresses such as drought, 
high or low temperatures, for example by overexpressing the antifreeze polypep- 
tides from Myoxocephalus scorpius (WO 00/00512), Myoxocephaius octodecem- 

20 spinosus, the Arabidopsis thaliana transcription activator CBF1 , glutamate dehydro- 
genases (WO 97/12983, WO 98/1 1240), a late embryogenesis gene (LEA), for ex- 
ample from barley (WO 97/13843), calcium-dependent protein kinase genes 
(WO 98/26045), calcineurins (WO 99/05902), farnesyl transferases (WO 99/06580, 
Pei 1998), ferritin (Deak 1999), oxalate oxidase (WO 99/0401 3;. Dunwell 1998), 

25 DREB1A factor (dehydration response element B 1A; Kasuga 1999), mannitol or 
trehalose synthesis genes, such as trehalose-phosphate synthase or trehalose- 
phosphate phosphatase (WO 97/42326), or by inhibiting genes such as the treha- 
lase gene (WO 97/50561). Especially preferred nucleic acids are those which en- 
code the transcriptional activator CBF1 from Arabidopsis thafiana (GenBank Acc. 

30 No.: U77378) or the Myoxocephalus octodecemspinosus antifreeze protein (Gen- 
Bank Acc. No.: AF306348), or functional equivalents of these. 



2. Obtaining resistance for example against fungi, insects, nematodes and diseases by 
the targeted secretion or concentration of specific metabolites or proteins in the em- 
bryol epidermis. Examples which may be mentioned are glucosinolates (defence 
against herbivores), chitinases or glucanases and other enzymes which destroy the 
cell wall of parasites, ribosome-inactivating proteins (RIPs) and other proteins of the 
plant's resistance and stress response as are induced upon wounding or microbial 
attack of plants or chemically by, for example, salicylic acid, jasmonic acid or ethyl- 
ene; lysozymes from nonplant sources such as, for example, T4 lysozyme or ly- 
sozyme from a variety of mammals, insecticidal proteins such as Bacillus thur- 
ingiensis endotoxin, a-amylase inhibitor or protease inhibitors (cowpea trypsin 
inhibitor), glucanases, lectins such as phytohemagglutinin, snowdrops lectin, wheat- 
germ agglutinin, RNAses or ribozymes. Nucleic acids which are especially preferred 
are those which encode the Trichoderma harzianum chit42 endochitinase (GenBank 
Acc. No.: S78423) or the Sorghum bicolor N-hydroxylating multifunctional cyto- 
chrome P-450 (CYP79) proteins (GenBank Acc. No.: U32624), or functional equiva- 
lents of these. 
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The transgenic expression constructs of the invention can be employed for suppressing 
or reducing expression of endogenous target genes by "gene silencing". Preferred 
genes or proteins whose suppression brings about an advantageous phenotype are 
known to the skilled worker. Examples may include but are not limited to down- 
regulation of the p-subunit of Arabidopsis G protein for increasing root mass (Uilah et 
al 2003), inactivating cyclic nucleotide-gated ion channel (CNGC) for improving dis- 
ease resistance (WO 2001007596), and down-regulation of 4-coumarate-CoA ligase 
(4CL) gene for altering lignin and cellulose contents (US 20021 38870). 

Gene silencing can be realized by antisense or double-stranded RNA or by co- 
suppression (sense-suppression). An "antisense" nucleic acid is firstly understood as 
meaning a nucleic acid sequence which is fully or partially complementary to at least 
part of the "sense" strand of said target protein. The skilled worker knows that he can 
use alternative cDNA or the corresponding gene as starting template for suitable an- 
tisense constructs. The "antisense" nucleic acid is preferably complementary to the 
coding region of the target protein or part thereof. However, the "antisense" nucleic 
acid may also be complementary to the non-coding region or part thereof. Starting from 
the sequence information on a target protein, an antisense nucleic acid can be de- 
signed in the manner with which the skilled worker is familiar, taking into consideration 
Watson's and Crick's rules of base pairing. An antisense nucleic acid can be comple- 
mentary to the entire or part of the nucleic acid sequence of a target protein. 

Likewise encompassed is the use of the above-described sequences in sense orienta- 
tion, which, as is known to the skilled worker, can lead to co-suppression (sense- 
suppression). It has been demonstrated that expression of sense can reduce or switch 
off expression of same, analogously to what has been described for antisense ap- 
proaches (Goring 1991; Smith 1990; Napoli 1990;Van der Krol1990). In this context, 
the construct introduced may represent the gene to be reduced fully or only in part. The 
possibility of translation is not necessary. 

Especially preferred is the use of gene regulation methods by means of double- 
stranded RNAi ("double-stranded RNA interference"). Such methods are known to the 
person skilled in the art (e.g., Matzke 2000; Fire 1998; WO 99/32619; WO 99/53050; 
WO 00/68374; WO 00/44914; WO 00/44895; WO 00/49035; WO 00/63364). The proc- 
esses and methods described in the references stated are expressly referred to. 

Furthermore, artificial transcription factors (e.g. of the zinc finger protein type; Beerli 
2000) can be expressed under control of a promoter of the invention to modulate ex- 
pression of specific endogenous genes. These factors attach to the regulatory regions 
of the endogenous genes to be expressed or to be repressed and, depending on the 
design of the factor, bring about expression or repression of the endogenous gene. 

TARGET ORGANISM 

Another subject matter of the invention relates to transgenic organisms transformed 
with at least one transgenic expression construct or vector of the invention, and to 
cells, cell cultures, tissues, organs (e.g., leaves, roots and the like in the case of plant 
organisms), or propagation material derived from such organisms. 
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The terms "organism", "target organism" or "host organism" are preferably understood 
as meaning prokaryotic or eukaryotic organisms, such as, for example, microorganisms 
or plant organisms. Preferred microorganisms are bacteria, yeasts, algae or fungi. 

Preferred bacteria are bacteria of the genus Escherichia, Erwinia, Agrobacterium, F/a- 
vobacterium, Alcaligenes or cyanobacteria, for example of the genus Synechocystis. 
Especially preferred are microorganisms which are capable of infecting plants and thus 
of transferring the constructs according to the invention. Preferred microorganisms are 
those from the genus Agrobacterium and, in particular, the species Agrobacterium tu- 
mefaciens. 

Preferred yeasts are Candida, Saccharomyces, Hansenula or Pichia. Preferred fungi 
are Aspergillus, Trichoderma, Ashbya, Neurospora, Fusarium, Beauveria or other 
fungi. Plant organisms are furthermore, for the purposes of the invention, other organ- 
isms which are capable of photosynthetic activity such as, for example, algae or 
cyanobacteria, and also mosses. Preferred algae are green algae such as, for exam- 
ple, algae of the genus Haematococcus, Phaedactylum tricornatum, Volvox or Du- 
n aliell a. 

Host or target organisms which are preferred as transgenic organisms are especially 
plants. Included within the scope of the invention are all genera and species of higher 
and lower plants of the plant kingdom. Included are furthermore the mature plants, 
seeds, shoots and seedlings and parts, propagation material and cultures derived 
therefrom, for example ceil cultures. The term "mature plants" is understood as mean- 
ing plants at any developmental stage beyond the seedling. The term "seedling" is un- 
derstood as meaning a young, immature plant in an early developmental stage. 

Annual, biennial, mohocotyledonous and dicotyledonous plants are preferred host or- 
ganisms for the generation of transgenic plants. The expression of genes is further- 
more advantageous in all ornamental plants, useful or ornamental trees, flowers, cut 
flowers, shrubs or lawns. Plants which may be mentioned by way of example but not by 
limitation are angiosperms, bryophytes such as, for example, Hepaticae (liverworts) 
and Musci (mosses); Pteridophytes such as ferns, horsetail and club mosses; gymno- 
sperms such as conifers, cycads, ginkgo and Gnetatae; algae such as Chlorophyceae, 
Phaeophpyceae, Rhodophyceae, Myxophyceae, Xanthophyceae, Bacillariophyceae 
(diatoms) and Euglenophyceae. 

Preferred are plants which are used for food or feed purpose such as the families of the 
Leguminosae such as pea, alfalfa and soya; Gramineae such as rice, maize, wheat, 
barley, sorghum, millet, rye, triticale, or oats; the family of the Umbelliferae, especially 
the genus Daucus, very especially the species carota (carrot) and Apium, very espe- 
cially the species Graveolens dulce (celery) and many others; the family of the Solana- 
ceae, especially the genus Lycopersicon, very especially the species esculentum (to- 
mato) and the genus Solanum, very especially the species tuberosum (potato) and 
me/ongena (egg plant), and many others (such as tobacco); and the genus Capsicum, 
very especially the species annuum (peppers) and many others; the family of the 
Leguminosae, especially the genus Glycine, very especially the species max (soy- 
bean), alfalfa, pea, lucerne, beans or peanut and many others; and the family of the 
Cruciferae (Brassicacae), especially the genus Brassica t very especially the species 
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napus (oil seed rape), campestris (beet), oleracea cv Tastie (cabbage), oleracea cv 
Snowball Y (cauliflower) and oleracea cv Emperor (broccoli); and of the genus Arabi- 
dopsis, very especially the species thaliana and many others; the family of the Compo- 
sitae, especially the genus Lactuca, very especially the species sativa (lettuce) and 
5 many others; the family of the Asteraceae such as sunflower, Tagetes, lettuce or Ca- 
lendula and many other; the family of the Cucurbitaceae such as melon, pump- 
kin/squash or zucchini, and linseed. Further preferred are cotton, sugar cane, hemp, 
flax, chillies, and the various tree, nut and wine species. 

10 Very especially preferred are Arabidopsis thaliana, Nicotiana tabacum, Tagetes erecta, 
Calendula officinalis, Gycine max, Zea mays, Oryza sativa, Jriticum aestivum, Pisum 
sativum, Phaseolus vulgaris, Hordium vulgare, Brassica napus. 

TRANSGENIC EXPRESSION VECTORS 

15 An expression construct according to the invention can advantageously be introduced 
into cells, preferably into plant cells, using vectors. In an advantageous embodiment, 
the expression construct is introduced by means of plasmid vectors. In one embodi- 
ment, the methods of the invention involve transformation of organism or cells (e.g. 
plants or plant cells) with a transgenic expression vector comprising at least a trans- 

20 genie expression cassette of the invention (as described above). As used herein, the 
terms "vector" and "vehicle" are used interchangeably in reference to nucleic acid 
molecules that transfer DNA segment(s) from one cell to another. The term "expression 
vector" as used herein refers to a recombinant DNA molecule containing a desired cod- 
ing sequence and appropriate nucleic acid sequences necessary for the expression of 

25 the operably linked coding sequence in a particular host organism. 

The methods of the invention are not limited to the expression vectors disclosed herein. 
Any expression vector which is capable of introducing a nucleic acid sequence of inter- 
est into a plant cell is contemplated to be within the scope of this invention. Typically, 
30 expression vectors comprise the transgenic expression cassette of the invention in 
combination with elements which allow cloning of the vector into a bacterial or phage 
host. The vector preferably, though not necessarily, contains an origin of replication 
which is functional in a broad range of prokaryotic hosts. A selectable marker is gener- 
ally, but not necessarily, included to allow selection of cells bearing the desired vector. 

35 

Examples of vectors may be plasmids, cosmids, phages, viruses or Agrobacteria. More 
specific examples are given below for the individual transformation technologies. 

Preferred are those vectors which make possible a stable integration of the expression 
40 construct into the host genome. In the case of injection or electroporation of DNA into 
plant cells, the plasmid used need not meet any particular requirements. Simple plas- 
mids such as those of the pUC series can be used. If intact plants are to be regener- 
ated from the transformed cells, it is necessary for an additional selectable marker 
gene to be present on the plasmid. A variety of possible plasmid vectors are available 
45 for the introduction of foreign genes into plants, and these plasmid vectors contain, as 
a rule, a replication origin for multiplication in E.coli and a marker gene for the selection 
of transformed bacteria. Examples are pBR322, pUC series, M13mp series, pA- 
CYC184 and the like. 
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The expression construct can be introduced into the vector via a suitable restriction 
cleavage site. The plasmid formed is first introduced into EcoiL Correctly transformed 
E.coli are selected and grown, and the recombinant plasmid is obtained by methods 
known to the skilled worker. Restriction analysis and sequencing can be used for veri- 
5 tying the cloning step. 

Depending on the method by which DNA is introduced, further genes may be neces- 
sary on the vector plasmid. 

10 Agrobacterium tumefaciens and A. rhizogenes are plant-pathogenic soil bacteria, which 
genetically transform plant cells. The Ti and Ri plasmids of A. tumefaciens and A. 
rhizogenes, respectively, carry genes responsible for genetic transformation of the 
plant (Kado 1991). Vectors of the invention may be based on the Agrobacterium Ti- or 
Ri-piasmid and may thereby utilize a natural system of DNA transfer into the plant ge- 

15 nome. 

As part of this highly developed parasitism Agrobacterium transfers a defined part of its 
genomic information (the T-DNA; flanked by about 25 bp repeats, named left and right 
border) into the chromosomal DNA of the plant cell (Zupan 2000). By combined action 

20 of the so-called vir genes (part of the original Ti-plasmids) said DNA-transfer is medi- 
ated. For utilization of this natural system, Ti-plasmids were developed which lack the 
original tumor inducing genes ("disarmed vectors"). In a further improvement, the so 
called "binary vector systems", the T-DNA was physically separated from the other 
functional elements of the Ti-plasmid (e.g., the vir genes), by being incorporated into a 

25 shuttle vector, which allowed easier handling (EP-A 120 516; US 4.940.838). These 
binary vectors comprise (beside the disarmed T-DNA with its border sequences), pro- 
karyotic sequences for replication both in Agrobacterium and E. coii. It is an advantage 
of Agrobacterium-med\a\e6 transformation that in general only the DNA flanked by the 
borders is transferred into the genome and that preferentially only one copy is inserted. 

30 Descriptions of Agrobacterium vector systems and methods for Agrobacterium- 
mediated gene transfer are known in the art (Miki 1993; Gruber 1993; Moloney 1989). 
The use of T-DNA for the transformation of plant cells has been studied and described 
intensively (EP 120516; Hoekema 1985; Fraley 1985; and An 1985). Various binary 
vectors are known, some of which are commercially available such as, for example, 

35 pBIN19 (Clontech Laboratories, Inc. U.S.A.). 

Hence, for ^grabacter/a-mediated transformation the transgenic expression construct 
of the invention is integrated into specific plasmids, either into a shuttle or intermediate 
vector, or into a binary vector. If a Ti or Ri plasmid is to be used for the transformation, 

40 at least the right border, but in most cases the right and left border, of the Ti or Ri 
plasmid T-DNA is linked to the transgenic expression construct to be introduced in the 
form of a flanking region. Binary vectors are preferably used. Binary vectors are capa- 
ble of replication both in E.coii and in Agrobacterium, They may comprise a selection 
marker gene and a linker or polylinker (for insertion of e.g. the expression construct to 

45 be transferred) flanked by the right and left T-DNA border sequence. They can be 
transferred directly into Agrobacterium (Holsters 1978). The selection marker gene 
permits the selection of transformed Agrobacteria and is, for example, the npt\\ gene, 
which confers resistance to kanamycin. The Agrobacterium which acts as host organ- 
ism in this case should already contain a plasmid with the vir region. The latter is re- 
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quired for transferring the T-DNA to the plant cell. An Agrobacterium transformed in this 
way can be used for transforming plant cells. The use of T-DNA for transforming plant 
cells has been studied and described intensively (EP 120 516; Hoekema 1985; An 
1985; see also below). 

5 

Common binary vectors are based on "broad host range'-plasmids like pRK252 (Bevan 

1984) or pTJS75 (Watson 1985) derived from the P-type plasmid RK2. Most of these 
vectors are derivatives of pBIN19 (Bevan 1984). Various binary vectors are known, 
some of which are commercially available such as, for example, pBI101.2 or pBIN19 

10 (Clontech Laboratories, Inc. USA). Additional vectors were improved with regard to size 
and handling (e.g. pPZP; Hajdukiewicz 1994). Improved vector systems are described 
also in WO 02/00900. 

In a preferred embodiment, Agrobacterium strains for use in the practice of the inven- 
15 tion include octopine strains, e.g., LBA4404 or agropine strains, e.g., EHA101 or 
EHA105. Suitable strains of A. tumefaciens for DNA transfer are for example 
EHA101pEHA101 (Hood1986), EHA105[pEHA105] (Li 1992), LBA4404[pAL4404] 
(Hoekema 1983), C58C1[pMP90] (Koncz 1986), and C58C1[pGV2260] (Deblaere 

1985) . Other suitable strains are Agrobacterium tumefaciens C58, a nopaline strain. 
20 Other suitable strains are A. tumefaciens C58C1 (Van Larebeke 1974), A136 (Watson 

1975) or LBA4011 (Klapwijk 1980). In a preferred embodiment, the Agrobacterium 
strain used to transform the plant tissue pre-cultured with the plant phenolic compound 
contains a L,L-succinamopine type Ti-plasmid, preferably disarmed, such as pEHA101. 
In another preferred embodiment, the Agrobacterium strain used to transform the plant 

25 tissue pre-cultured with the plant phenolic compound contains an octopine-type Ti- 
plasmid, preferably disarmed, such as pAL4404. Generally, when using octopine-type 
Ti-plasmids or helper plasmids, it is preferred that the virF gene be deleted or inacti- 
vated (Jarschow 1991). In a preferred embodiment, the Agrobacterium strain used to 
transform the plant tissue pre-cultured with the plant phenolic compound such as ace- 

30 tosyringone. The method of the invention can also be used in combination with particu- 
lar Agrobacterium strains, to further increase the transformation efficiency, such as 
Agrobacterium strains wherein the vir gene expression and/or induction thereof is al- 
tered due to the presence of mutant or chimeric virA or virG genes (e.g. Hansen 1994; 
Chen 1991; Scheeren-Groot 1994). 

35 

A binary vector or any other vector can be modified by common DNA recombination 
techniques, multiplied in E. co//, and introduced into Agrobacterium by e.g., electropo- 
ration or other transformation techniques (Mozo 1991). Agrobacterium is grown and 
used as described in the art. The vector comprising Agrobacterium strain may, for ex- 
40 ample, be grown for 3 days on YP medium (5 g/L yeast extract, 10 g/L peptone, 5 g/L 
Nail, 15 g/L agar, pH 6.8) supplemented with the appropriate antibiotic (e.g., 50 mg/L 
spectinomycin). Bacteria are collected with a loop from the solid medium and resus- 
pended. 

45 TRANFORMATION TECHNIQUES 

The generation of a transformed organism or a transformed cell requires introducing 
the DNA in question into the host cell in question. A multiplicity of methods is available 
for this procedure, which is termed transformation (see also Keown (1990) Methods in 
Enzymology 185:527-537). For example, the DNA can be introduced directly by micro- 
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injection or by bombardment via DNA-coated microparticles. Also, the cell can be per- 
meabilized chemically, for example using polyethylene glycol, so that the DNA can en- 
ter the cell by diffusion. The DNA can also be introduced by protoplast fusion with other 
DNA-containing units such as minicells, cells, lysosomes or liposomes. Another suit- 
5 able method of introducing DNA is electroporation, where the cells are permeabilized 
reversibly by an electrical pulse. 

Methods for introduction of a transgenic expression construct or vector into plant tissue 
may include but are not limited to. e.g., electroinjection (Nan 1995; Griesbach 1992); 
10 fusion with liposomes, lysosomes, cells, minicells or other fusible lipid-surfaced bodies 
(Fraley 1982); polyethylene glycol (Krens 1982); chemicals that increase free DNA up- 
take; transformation using virus, and the like. Furthermore, the biolistic method with the 
gene gun, electroporation, incubation of dry embryos in DNA-containing solution, and 
microinjection may be employed. 

15 

Protoplast based methods can be employed (e.g., for rice), where DNA is delivered to 
the protoplasts through liposomes, PEG, or electroporation (Shimamoto 1989; Datta 
1990b). Transformation by electroporation involves the application of short, high- 
voltage electric fields to create "pores" in the cell membrane through which DNA is 
20 taken-up. These methods are - for example - used to produce stably transformed 
monocotyledonous plants (Paszkowski 1984; Shillito 1985; Fromm 1986) especially 
from rice (Shimamoto 1989; Datta 1990b; Hayakawa 1992). 

Particle bombardment or "biolistics" is a widely used method for the transformation of 
25 plants, especially monocotyledonous plants. In the "biolistics" (microprojectile-mediated 
DNA delivery) method microprojectile particles are coated with DNA and accelerated 
by a mechanical device to a speed high enough to penetrate the plant cell wall and 
nucleus (WO 91/02071). The foreign DNA gets incorporated into the host DNA and 
results in a transformed cell. There are many variations on the "biolistics" method (San- 
30 ford 1990; Fromm 1990; Christou 1988; Sautter 1991). The method has been used to 
produce stably transformed monocotyledonous plants including rice, maize, wheat, 
barley, and oats (Christou 1991; Gordon-Kamm 1990; Vasil 1992, 1993; Wan 1994; 
Sommers 1992). 

35 In addition to these "direct" transformation techniques, transformation can also be ef- 
fected by bacterial infection by means of Agrobacterium tumefaciens or Agrobacterium 
rhizogenes. These strains contain a plasmid (Ti or Ri plasmid) which is transferred to 
the plant following Agrobacterium infection. Part of this plasmid, termed T-DNA (trans- 
ferred DNA), is integrated into the genome of the plant cell (see above for description 

40 of vectors). To transfer the DNA to the plant cell, plant explants are cocultured with a 
transgenic Agrobacterium tumefaciens or Agrobacterium rhizogenes. Starting from 
infected plant material (for example leaf, root or stem sections, but also protoplasts or 
suspensions of plant cells), intact plants can be generated using a suitable medium 
which may contain, for example, antibiotics or biocides for selecting transformed cells. 

45 The plants obtained can then be screened for the presence of the DNA introduced, in 
this case the expression construct according to the invention. As soon as the DNA has 
integrated into the host genome, the genotype in question is, as a rule, stable and the 
insertion in question is also found in the subsequent generations. As a rule, the*ex- 
pression construct integrated contains a selection marker which imparts a resistance to 
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a biocide (for example a herbicide) or an antibiotic such as kanamycin, G 418, bleomy- 
cin, hygromycin or phosphinotricin and the like to the transformed plant. The selection 
marker permits the selection of transformed cells from untransformed cells (McCormick 

1986) . The plants obtained can be cultured and hybridized in the customary fashion. 
5 Two or more generations should be grown in order to ensure that the genomic integra- 
tion is stable and hereditary. The abovementioned methods are described (for exam- 
ple, in Jenes 1983; and in Potrykus 1991). 

One of skill in the art knows that the efficiency of transformation by Agrobacterium may 
10 be enhanced by using a number of methods known in the art. For example, the inclu- 
sion of a natural wound response molecule such as acetosyringone (AS) to the Agro- 
bacterium culture has been shown to enhance transformation efficiency with Agrobac- 
terium tumefaciens (Shahla 1987). Alternatively, transformation efficiency may be en- 
hanced by wounding the target tissue to be transformed. Wounding of plant tissue may 
15 be achieved, for example, by punching, maceration, bombardment with microprojec- 
tiles, etc. (see, e.g., Bidney 1992). 

A number of other methods have been reported for the transformation of plants (espe- 
cially monocotyledonous plants) including, for example, the "pollen tube method" (WO 
20 93/18168; Luo 1988), macro-injection of DNA into floral tillers (Du 1989; De la Pena 

1987) , injection of Agrobacterium into developing caryopses (WO 00/63398), and tis- 
sue incubation of seeds in DNA solutions (Topfer 1989). Direct injection of exogenous 
DNA into the fertilized plant ovule at the onset of embryogenesis was disclosed in WO 
94/00583. WO 97/48814 disclosed a process for producing stably transformed fertile 

25 wheat and a system of transforming wheat via Agrobacterium based on freshly isolated 
or pre-cultured immature embryos, embryogenic callus and suspension cells. 

It may be desirable to target the nucleic acid sequence of interest to a particular locus 
on the plant genome. Site-directed integration of the nucleic acid sequence of interest 

30 into the plant ceil genome may be achieved by, for example, homologous recombina- 
tion using Agrobacterium-derived sequences. Generally, plant cells are incubated with 
a strain of Agrobacterium which contains a targeting vector in which sequences that 
are homologous to a DNA sequence inside the target locus are flanked by Agrobacte- 
rium transfer-DNA (T-DNA) sequences, as previously described (US 5,501,967, the 

35 entire contents of which are herein incorporated by reference). One of skill in the art 
knows that homologous recombination may be achieved using targeting vectors which 
contain sequences that are homologous to any part of the targeted plant gene, whether 
belonging to the regulatory elements of the gene, or the coding regions of the gene. 
Homologous recombination may be achieved at any region of a plant gene so long as 

40 the nucleic acid sequence of regions flanking the site to be targeted is known. 

Where homologous recombination is desired, the targeting vector used may be of the 
replacement- or insertion-type (US 5,501,967; supra). Replacement-type vectors gen- 
erally contain two regions which are homologous with the targeted genomic sequence 
45 and which flank a heterologous nucleic acid sequence, e.g., a selectable marker gene 
sequence. Replacement-type vectors result in the insertion of the selectable marker 
gene which thereby disrupts the targeted gene. Insertion-type vectors contain a single 
region of homology with the targeted gene and result in the insertion of the entire tar- 
geting vector into the targeted gene. 
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SELECTION OF TRANSGENIC CELLS 

Transformed cells, /.e. those which contain the introduced DNA integrated into the DNA 
of the host cell, can be selected from untransformed cells if a selectable marker is part 
of the introduced DNA. A selection marker gene may confer positive or negative selec- 
5 tion. 

A positive selection marker gene may be used in constructs for random integration and 
site-directed integration. Positive selection marker genes include antibiotic resistance 
genes, and herbicide resistance genes and the like. Transformed cells which express 

10 such a marker gene are capable of surviving in the presence of concentrations of the 
antibiotic or herbicide in question which kill an untransformed wild type. Examples are 
the bar gene, which imparts resistance to the herbicide phosphinotricin (bialaphos; Va- 
SH1992; Weeks1993; Rathore 1993), the npt\\ gene, which imparts resistance to kana- 
mycin, the hpt gene, which imparts resistance to hygromycin, or the EPSP gene, which 

15 imparts resistance to the herbicide glyphosate, geneticin (G-418) (aminoglycoside) 
(Nehra1994), glyphosate (Della-Cioppa1987) and the ALS gene (chlorsulphuron resis- 
tance). Further preferred selectable and screenable marker genes are disclosed above. 

A negative selection marker gene may also be included in the constructs. The use of 

20 one or more negative selection marker genes in combination with a positive selection 
marker gene is preferred in constructs used for homologous recombination. Negative 
selection marker genes are generally placed outside the regions involved in the ho- 
mologous recombination event. The negative selection marker gene serves to provide 
a disadvantage (preferably lethality) to cells that have integrated these genes into their 

25 genome in an expressible manner. Cells in which the targeting vectors for homologous 
recombination are randomly integrated in the genome will be harmed or killed due to 
the presence of the negative selection marker gene. Where a positive selection marker 
gene is included in the construct, only those cells having the positive selection marker 
gene integrated in their genome will survive. The choice of the negative selection 

30 marker gene is not critical to the invention as long as it encodes a functional polypep- 
tide in the transformed plant cell. The negative selection gene may for instance be cho- 
sen from the aux-2 gene from the Ti-plasmid of Agrobacterium, the tk-gene from SV40, 
cytochrome P450 from Streptomyces griseolus, the Adh gene from Maize or Arabidop- 
sis, etc. Any gene encoding an enzyme capable of converting a substance which is 

35 otherwise harmless to plant cells into a substance which is harmful to plant cells may 
be used. Further preferred negative selection markers are disclosed above. 

However, insertion of an expression cassette or a vector into the chromosomal DNA 
can also be demonstrated and analyzed by various other methods (not based on selec- 
40 tion marker) known in the art like including, but not limited to, restriction mapping of the 
genomic DNA, PCR-analysis, DNA-DNA hybridization, DNA-RNA hybridization, DNA 
sequence analysis and the like. More specifically such methods may include e.g., PCR 
analysis, Southern blot analysis, fluorescence in situ hybridization (FISH), and in situ 
PCR. 

45 

REGENERATION OF TRANSGENIC ORGANISM 

As soon as a transformed plant cell has been generated, an intact plant can be ob- 
tained using methods known to the skilled worker. Accordingly, the present invention 
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provides transgenic plants. The transgenic plants of the invention are not limited to 
plants in which each and every cell expresses the nucleic acid sequence of interest 
under the control of the promoter sequences provided herein. Included within the scope 
of this invention is any plant which contains at least one cell which expresses the nu- 
5 cleic acid sequence of interest (e.g., chimeric plants). It is preferred, though not neces- 
sary, that the transgenic plant comprises the nucleic acid sequence of interest in more 
than one cell, and more preferably in one or more tissue. 

Once transgenic plant tissue which contains an expression vector has been obtained, 
10 transgenic plants may be regenerated from this transgenic plant tissue using methods 
known in the art. The term "regeneration" as used herein, means growing a whole plant 
from a plant cell, a group of plant cells, a plant part or a plant piece (e.g., from a proto- 
plast, callus, protocorm-like body, or tissue part). 

15 Species from the following examples of genera of plants may be regenerated from 
transformed protoplasts: Fragaria, Lotus, Medicago, Onobrychis, Trifolium, Trigonella, 
Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Arabidopsis, Brassica, Raphanus, 
Sinapis, Atropa, Capsicum, Hyoscyamus, Lycopersicon, Nicotiana, Solanum, Petunia, 
Digitalis, Majorana, Ciohonum, Helianthus, Lactuca, Bromus, Asparagus, Antirrhinum, 

20 Hererocaliis, Nemesia, Pelargonium, Panicum, Pennisetum, Ranunculus, Senecio, 
Salpiglossis, Cucumis, Browaalia, Glycine, Pisum, Lolium, Zea, Triticum, Sorghum, and 
Datura. 

For regeneration of transgenic plants from transgenic protoplasts, a suspension of 
25 transformed protoplasts or a Petri plate containing transformed explants is first pro- 
vided. Callus tissue is formed and shoots may be induced from callus and subse- 
quently rooted. Alternatively, somatic embryo formation can be induced in the callus 
tissue. These somatic embryos germinate as natural embryos to form plants. The cul- 
ture media will generally contain various amino acids and plant hormones, such as 
30 auxin and cytokinins. It is also advantageous to add glutamic acid and proline to the 
medium, especially for such species as corn and alfalfa. Efficient regeneration will de- 
pend on the medium, on the genotype, and on the history of the culture. These three 
variables may be empirically controlled to result in reproducible regeneration. 

35 Plants may also be regenerated from cultured cells or tissues. Dicotyledonous plants 
which have been shown capable of regeneration from transformed individual cells to 
obtain transgenic whole plants include, for example, apple (Malus pumila), blackberry 
(Rubus), Blackberry/raspberry hybrid (Rubus) t red raspberry (Rubus), carrot (Daucus 
carota), cauliflower (Brassica oleracea), celery (Apium graveolens), cucumber (Cucu- 

40 mis sativus), eggplant (Solanum melongena), lettuce (Lactuca sativa), potato (Solanum 
tuberosum), rape (Brassica napus), wild soybean (Glycine canescens), strawberry 
(Fragaria ananassa), tomato (Lycopersicon esculentum), walnut (Juglans regia), melon 
(Cucumis melo), grape (Vitis vinifera), and mango (Mangifera indica). Monocotyledon- 
ous plants which have been shown capable of regeneration from transformed individual 

45 celis to obtain transgenic whole plants include, for example, rice (Oryza sativa), rye 
(Secale cereale), and maize (Zea mays). 

In addition, regeneration of whole plants from cells (not necessarily transformed) has 
also been observed in: apricot (Prunus armeniaca), asparagus (Asparagus officinalis), 
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banana (hybrid Musa), bean (Phaseolus vulgaris), cherry (hybrid Prunus), grape (Vitis 
vinifera), mango (Mangifera indica), melon (Cucumis melo), ochra (Abelmoschus escu- 
lentus), onion (hybrid Allium), orange (Citrus sinensis), papaya (Carrica papaya), peach 
(Prunus persica), plum (Prunus domestica), pear (Pyrus communis), pineapple 
5 (Ananas comosus), watermelon (Citrullus vulgaris), and wheat (Triticum aestivum). 

The regenerated plants are transferred to standard soil conditions and cultivated in a 
conventional manner. After the expression vector is stably incorporated into regener- 
ated transgenic plants, it can be transferred to other plants by vegetative propagation 

10 or by sexual crossing. For example, in vegetatively propagated crops, the mature 
transgenic plants are propagated by the taking of cuttings or by tissue culture tech- 
niques to produce multiple identical plants. In seed propagated crops, the mature 
transgenic plants are self crossed to produce a homozygous inbred plant which is ca- 
pable of passing the transgene to its progeny by Mendelian inheritance. The inbred 

15 plant produces seed containing the nucleic acid sequence of interest. These seeds can 
be grown to produce plants that would produce the selected phenotype. The inbred 
plants can also be used to develop new hybrids by crossing the inbred plant with an- 
other inbred plant to produce a hybrid. 

20 Confirmation of the transgenic nature of the cells, tissues, and plants may be per- 
formed by PCR analysis, antibiotic or herbicide resistance, enzymatic analysis and/or 
Southern blots to verify transformation. Progeny of the regenerated plants may be ob- 
tained and analyzed to verify whether the transgenes are heritable. Heritability of the 
transgene is further confirmation of the stable transformation of the transgene in the 

25 plant. The resulting plants can be bred in the customary fashion. Two or more genera- 
tions should be grown in order to ensure that the genomic integration is stable and he- 
reditary. Corresponding methods are described, (Jenes 1993; Potrykus 1991). 

Also in accordance with the invention are cells, cell cultures, tissues, parts, organs- 
30 such as, for example, roots, leaves and the like in the case of transgenic plant organ- 
isms - derived from the above-described transgenic organisms, and transgenic propa- 
gation material such as seeds or fruits. 

Genetically modified plants according to the invention which can be consumed by hu- 
35 mans or animals can also be used as food or feedstuffs, for example directly or follow- 
ing processes known per se. 

A further subject matter of the invention relates to the use of the above-described 
transgenic organisms according to the invention and the cells, cell cultures, parts, tis- 
40 sues, organs- such as, for example, roots, leaves and the like in the case of transgenic 
plant organisms - derived from them, and transgenic propagation material such as 
seeds or fruits, for the production of foods or feedstuffs, pharmaceuticals or fine chemi- 
cals. 

45 Preferred is furthermore a method for the recombinant production of pharmaceuticals 
or fine chemicals in host organisms, where a host organism is transformed with one of 
the above-described expression constructs, and this expression construct contains one 
or more structural genes which encode the desired fine chemical or catalyze the bio- 
synthesis of the desired fine chemical, the transformed host organism is cultured, and 
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10 



15 



20 



25 



the desired fine chemical is isolated from the culture medium. This process can be 
used widely for fine chemicals such as enzymes, vitamins, amino acids, sugars, fatty 
acids, natural and synthetic flavorings, aroma substances and colorants. Especially 
preferred is the production of tocopherols and tocotrienols, carotenoids, oils, polyun- 
saturated fatty acids etc. Culturing the transformed host organisms, and isolation from 
the host organisms or the culture medium, is performed by methods known to the 
skilled worker. The production of pharmaceuticals such as, for example, antibodies, 
vaccines, enzymes or pharmaceutical^ active proteins is described (Hood 1999;Ma 
1999; Russel 1999; Cramer 1999; Gavilondo 2000; Holliger 1999). 

Sequences 

1. SEQ ID NO: 1 Nucleic acid sequence encoding the ptxA promoter (including the 5' 

untranslated region of the ptxA gene) 

2. SEQ ID NO: 2 Nucleic acid sequence encoding the SbHRGP3 promoter (including 

the 5' untranslated region of the SbHRGP3 gene) 

3. SEQ ID NO: 3 Forward primer ptxA5' 5 , -GGCGCGCCCGCAATTTTTTGTGAAGC-3 , 

4. SEQ ID NO: 4 Reverse primer ptxA3' 5'-TCTAGATAAGTTTCGAAGATTTTAG-3' 

5. SEQ ID NO: 5 Forward Primer SbHRGP3 

S-TCTAGATAGAAGCTTTTCAACAATCATGC-S' 

6. SEQ ID NO: 6 Reverse primer SbHRGP3 5'-AG ATCTTACTGCCATTAG GAGAGG-3* 



7. SEQ ID NO: 7 Nucleic acid sequence encoding functional equivalent homolog of 

the SbHRGP3 promoter (including the 5' untranslated region of the 
30 SbHRGP3 gene) 

8. SEQ ID NO: 8 Nucleic acid sequence encoding functional equivalent homolog of 

the SbHRGP3 promoter (including the 5' untranslated region of the 
SbHRGP3 gene) 

35 

9. SEQ ID NO: 9 Nucleic acid sequence encoding functional equivalent homolog of 

the SbHRGP3 promoter (including the 5' untranslated region of the 
SbHRGP3 gene) 

40 10. SEQ ID NO: 10 Nucleic acid sequence encoding chimeric ptxA promoter - 

ubiquitin intron construct. 



1 1 . SEQ ID NO: 1 1 Reverse primer-2 ptxA3'-2 

5'-TCTAGATAAACTATGAAGCTTTG-3' 



45 
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Examples 
Chemicals 

Unless indicated otherwise, chemicals and reagents in the Examples were obtained 
from Sigma Chemical Company (St. Louis, MO), restriction endonudeases were from 
New England Biolabs (Beverly, MA) or Roche (Indianapolis, IN), oligonucleotides were 
synthesized by MWG Biotech Inc. (High Point, NC), and other modifying enzymes or 
kits regarding biochemicals and molecular biological assays were from Clontech (Palo 
Alto, CA), Pharmacia Biotech (Piscataway, NJ), Promega Corporation (Madison, Wl), 
or Stratagene (La Jolla, CA). Materials for cell culture media were obtained from Gib- 
co/BRL (Gaithersburg, MD) or DIFCO (Detroit, MI). The cloning steps carried out for 
the purposes of the present invention, such as, for example, restriction cleavages, aga- 
rose gel electrophoresis, purification of DNA fragments, transfer of nucleic acids to ni- 
trocellulose and nylon membranes, linking DNA fragments, transformation of £. coli 
cells, growing bacteria, multiplying phages and sequence analysis of recombinant 
DNA, are carried out as described by Sambrook (1989). The sequencing of recombi- 
nant DNA molecules is carried out using ABI laser fluorescence DNA sequencer follow- 
ing the method of Sanger (Sanger 1 977). 

Example 1 : Growth conditions of the plants for tissue-specific RT-PCR analysis 
or Northern analysis 

In order to obtain 6-day old seedlings, in each case approximately 500 seeds (Arabi- 
dopsis thaliana ecotype Columbia) are surface-sterilized for 2 minutes with a 70% 
strength ethanol solution, treated for 2 minutes with a sodium hypochlorite solution (5% 
v/v), washed five times with distilled water and incubated for 1 day at 4°C in order to 
ensure uniform germination. The seeds are subsequently sown in sterilized containers 
(9.7 cm x 9.6 cm x 9 cm) on filter paper soaked in Hoagland's nutrient solution (modi- 
fied for Arabidopsis thaliana). Hoagland's solution is prepared with three different 200x 
stock solutions. Stock solution I comprises 0.5 M Ca(N0 3 ) 2f stock solution II comprises 
0.1 M MgS0 4t and stock solution III comprises 0.5 M KNQ 3 and 0.1 M KH 2 P0 4 . Before 
use, all stock solutions were diluted 1:200 and then mixed 1:1:1. Trace elements were 
added by means of a 2000x trace element stock solution (6 x 1 0" 2 M H 3 B0 3f 4.5 x 10"^ 
M MNCI 2 , 3.8 x 10* M ZnS0 4 , 3 x lO^M CuS0 4 , 1 x 10" 4 M (NH 4 ) 6 M0 7 0 24 ) and 250x 
Fe-EDTA stock solution (10 mM FeCI 3 , 10 mM Na-EDTA). The pH of the stock solution 
was then brought to 6.0 using 5 N KOH, and the Hoagland solution was then auto- 
claved. The seedlings are grown in the dark at 22°C and harvested 6 days after the 
germination phase has begun. 

To obtain roots, 100 seeds are sterilized as described above, incubated for 4 days at 
4°C and then grown in 250 mL flasks with MS medium (Sigma M5519) with addition of 
a further 3% sucrose and 0.5 g/LMES (Sigma M8652), pH 5.7. The seedlings are 
grown in a 16/8 hour photoperiod (Philips 58W/33 white-light lamp) at 22°C and 
120 rpm and harvested after 3 weeks. For all the other plant organs used, the seeds 
are grown in standard soil, incubated for 4 days at 4°C to ensure uniform germination 
and then grown first under short-day conditions at 22°C, 9 h light (150 pE/m 2 S) and 
60 to 65% relative atmospheric humidity, the temperature being lowered to 18°C during 
the night. In order to stimulate the development of shoot and flower, the plants were 
transferred into long-day conditions under a 16/8 hour photoperiod (OSRAM Lumilux 
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Daylight 36W/12 fluorescent tubes) at 22°C. Young rosette leaves are harvested in the 
8-leaf stage (after 3 weeks), and stems and opened flowers are harvested at develop- 
ment stage 14 (Smyth 1990) immediately after the stamens have developed. The 
green pods which were used were 10 to 13 mm in length. 

Example 2: RNA extraction and RT-PCR analysis 

Total RNA is isolated from the plant organs described in Example 1 at various points in 
time of the development, following the RNA isolation protocol (Sambrook 1989) as 
modified for Arabidopsis thaliana. The samples were comminuted finely in a pestle and 
mortar with liquid N 2 , 1 mL of homogenization buffer was added (4 M guanidinium thio- 
cyanate, 0.1 M Tris HCI pH 7.0, 10 mM EDTA, 0.5% sodium laurylsarcosine, 1% (v/v) 
of p-mercaptoethanol), carefully disrupted further while defrosting and transferred into a 
2 mL reaction vessel filled with 800 fiL of phenol/chloroform/isoamyl alcohol (P/C/l) 
(25:24:1 v/v, covered with a layer of DEPC (diethylpyrocarbonate) treated water. The 
mixture was vortexed for 1 minute, centrifuged for 15 minutes at 4°C and 17,500 x g, 
the aqueous phase was removed and re-extracted by shaking with 800 fiL of P/C/I and 
centrifuged (for 15 minutes at 4°C and 17,500 x g). To remove the phenol, the mixture 
was extracted with 800 |iL of chloroform/isoamyl alcohol (24:1 v/v). Better phase sepa- 
ration was achieved by recentrifugation (see above). The supernatant was removed, 
and the nucleic acids were precipitated for 1 hour at -20°C with the same volume of 
isopropanol. The precipitate was sedimented at 4°C for 15 minutes at 17,500 x g, 
washed with 3 M sodium acetate (pH 5.4), recentrifuged (for 10 minutes at 17,500 x g 
and 4°C), and then washed 2 more times with ice-cold 70% ethanol. The pellet was 
resuspended in 750 jiL of TENS buffer (50 mM Tris HCI pH 8.0, 10 mM EDTA. 100 mM 
NaCI, 2% SDS (w/v), 3 mg/mL diethyl thiocyanate), extracted with 800 jiL of P/C/l and 
extracted by shaking with 800 \iL of chloroform/isoamyl alcohol (see above). 5 M LiCI 
was added to the aqueous phase in a ratio of 1 : 1, the RNA was precipitated overnight 
at 4°C and then removed by centrifugation for 30 minutes at 4°C and 17,500 x g. 
Thereupon, the pellet was washed twice with 70% strength ethanol, dried at 50°C in a 
heating block and resuspended in 40 of H 2 0. All of the solutions were made with 
triple-distilled H 2 0 which had previously been treated with diethyl pyrocarbonate 
(DEPC) and subsequently autoclaved. 

The reverse transcriptase polymerase chain reaction (RT-PCR) is used to detect the 
ptxA or SbHRGP3 gene transcript The first-strand cDNA synthesis is carried out start- 
ing with 6 [ig of total RNA with an oligo (dT) primer and RT Superscript™! I enzyme 
(200 units) following the manufacturer's instructions in a total volume of 20 |iL (Life 
Technologies, Gaithersburg, MD; Cat. No. 18064-022). For the RNA, 500 ng of oligo 
(dT) primer is added in a final volume of 12 \iL. The mixture is heated for 10 minutes at 
70°C and subsequently immediately cooled on ice. Then, 4 pi of the 5x first-strand 
buffer [250 mM Tris-HCI (pH 8.3 at room temperature), 375 mM KCI, 15 mM MgCIJ, 2 
|xL of 0.1 M DTT and 1 fiL of 10 mM dNTP mix (in each case 10 mM dATP, dCTP, 
dGTP and dTTP at neutral pH) are added. The mixture is heated for 2 minutes at 42°! 
RT Superscript™ II enzyme (1 jiL (200 units), Life Technologies) is added, and the mix- 
ture is incubated for 50 minutes at 42°C. The oligo (dT) primer used is an oligonucleo- 
tide with 1 7 dT residues. 
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Approximately 2 \iL of the first-strand cDNA synthesis are employed for the PGR reac- 
tion. The foilowings are combined in a total volume of 50 juL, following the manufac- 
turer's instructions (Life Technologies): 

5 5 nL of 1 0x PCR buffer [200 mM Tris-HCI (pH 8.4), 500 mM KCI] 

1.5 pLof50 mM MgCI 2 

1 nL 10 mM dNTP mix (in each case 10 mM dATP, dCTP, dGTP and dTTP) 
1 |iL amplification primer 1 (10 mM) 

1 |iL amplification primer 2(10 mM) 

1 0 0.4 |xL Taq DNA polymerase (5 U/|j|) 

2 jiL cDNA (from the first-strand cDNA synthesis) 
38.1 iiL of autoclaved distilled water 

The reaction mixture is covered with a layer of approx. 50 [it of silicone oil and sub- 
15 jected to the following temperature program (Thermocycler: MWG Biotech Primus HT; 
MWG Biotech, Germany): 

1 cycle of 1 80 sec at 95°C 

30 cycles of 40 sec at 95°C, 60 sec at 53°C and 2 min at 72°C. 
20 1 cycle of 5 minutes at 72°C. 

The presence of the ptxA or SbHRGP3 mRNA in a sample is then detected electropho- 
retically by staining, for example with ethidium bromide, by separating the reaction mix- 
ture on a 1 % agarose gel. 

25 

The amplifications primers employed ("forward" and "reverse" primers) are the following 
oligonucleotides: 

Forward primer (ptxA5'): 5M3GCGCGCCCGCAATTTTTTGTGAAGC-3' (SEQ ID NO:3) 

30 

Reverse primer-1 (ptxA3'): S-TCTAGATAAGTTTCGAAGATTTTAG -3' (SEQ ID NO: 4) 

For amplification of the ptxA promoter described by base 1 to 828 of SEQ ID NO: 1 the 
following reverse primer is used instead of primer ptxA3': 

35 

Reverse primer-2 (ptxA3'-2): 5-TCTAGATAAACTATGAAGCTTTG-3' (SEQ ID NO: 11) 
Further primers can be derived from the known cDNA sequence of the GUS gene. 

40 Example 3: Cloning of the ptxA or SbHRGP3 promoter 

Genomic DNA from pea and soybean is extracted using the Qiagen DNAeasy Plant 
Mini Kit (Qiagen). The ptxA promoter region (882 bp) was isolated from genomic DNA 
of pea {Pisum sativum) using conventional PCR. Approximately 0.1 pg of digested ge- 
nomic DNA was uses for the regular PCR reaction (see below). The primers were de- 
45 signed based on the pea ptxA sequence disclosed by Bown (GeneBank accession 
number X67427.1). One pL of the diluted digested genomic DNA was used as the DNA 
template in the primary PCR reaction. The reaction comprised primers primer 1 (SEQ 
ID NO:1) and primer 2 (SEQ ID NO:2) in a mixture containing Buffer 3 following the 



BASF Plant Science GmbH 



20040055 
51 



PF 55368-2 US 



protocol outlined by an Expand Long PCR kit (Cat #1681-842, Roche-Boehringer 
Mannheim). The isolated DNA is employed as template DNA in a PCR amplification 
reaction using the following primers: 

5 Forward Primer (SbHRGP3 5') 5-TCTAGATAGAAGCTTTTCAACAATCATGC-3' 
(SEQ ID NO: 5) 

Reverse primer (SbHRGP3 3') 5'-AGATCTTACTGCCATTAGGAGAGG-3' (SEQ ID 
NO: 6) 

10 

Amplification is carried out as follows: 

1 x PCR reaction buffer (Roche Diagnostics) 
5 \xL genomic DNA (corresponds to approximately 80 ng) 
15 2.5 mM of each dATP, dCTP, dGTP and dTTP (Invitrogen: dNTP mix) 

1 \xL primer SbHRGP3 5' (SEQ ID NO: 5) 330 mg/mL 
1 \iL primer SbHRGP3 3* (SEQ ID NO: 6) 230 mg/mL 
1 jiL Taq DNA polymerase 5 U/jiL (Roche Diagnostics), 
in a final volume of 100 |iL. 

20 

The following temperature program is used (Thermocycler:T3 Thermocycler Biometra ): 
1 cycle with 1 80 sec at 95°C 

30 cycles with 40 sec at 95°C, 60 sec at 53°C and 2 min at 72°C 
25 1 cycle with 5 min at 72°C 

The PCR product is applied to a 1% (w/v) agarose gel and separated at 80V. Frag- 
ments of approximately 882 base pairs in length are excised from the gel and purified 
with the aid of the Qiagen Gel Extraction Kit (Qiagen, Hilden, Germany). If appropriate, 
30 the eluate of 50 \iL can be evaporated. The purified DNA is digested as follows for 2 
hours at 37°C: 

19 julL purified PCR-DNA 

1 [iLAscl restriction enzyme (10 U, Roche Diagnostics) 
35 1 \iLXba\ restriction enzyme (10 U, Roche Diagnostics) 

10 \iL buffer B (Roche Diagnostics) 
69 |llL distilled water 

This is followed by purification via the PCR Purification Kit (Roche Diagnostics). The 
40 cut and purified DNA fragment is inserted into the Bluescript plasmid (Stratagene) into 
the Asc\ and Xbal cleavage sites. Ligation of the vectors, transformation into E.coli 
cells and analysis of the plasmids is carried out by standard methods (Sambrook 
1989). The identity can be verified by sequencing the plasmid and comparison with the 
genomic DNA sequence (Genbank Number X67427.1). The resulting construct is 
45 pBPS-ptxA or pBPS-SbHRGP3 (Fig. 2, construct I). 

As an alternative, the PCR product can be cloned directly into vector pCR4-TOPO (In- 
vitrogen) following the manufacturer's instructions, i.e. the PCR product obtained is 
inserted into a vector having T overhangs with its A overhangs and a topoisomerase. 
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EXAMPLE 4: Construction of ptxA or SbHRGP3 promoter containing transfor- 
mation vectors 

PtxA promoter fragment in the Topo vector (Invitrogen) is digested with Asc\ and Xba\ 
at 37°C for 2h or 4°C overnight. The promoter fragment was purified from the gel 
(Qiagen kit) after electrophoresis and cloned into upstream of GUS reporter gene in 
pUC using Rapid Ligation kit (Roche). The ligation solution is transformed into E.coli 
DH5a ceils (Stratagene). The GUS chimeric constructs in pUC are digested with AscI 
and Pmel for and cloned into a binary vector. SbHRGP3 is cloned into Xba\ and Sglll 
sites in a binary vector to generate the GUS chimeric construct. 

GUS chimeric constructs for monocototyledonous plant transformation is made by add- 
ing intron of interest in the 5' untranslated region, which is located between down- 
stream of the promoter and upstream of the reporter gene. Intron of interest is ampli- 
fied with the primers containing Pac\ for 5' terminus and Sbf\ or Xma\ for the 3' termi- 
nus overhang. The PCR fragment is digested with Pad and Sbfl orXmal and cloned in 
the 5' untranslated region of a binary vector. 

EXAMPLE 5: Agrofcacter/um-mediated transformation in dicotyledonous and 
monocotyledonous plants 

5.1: Transformation and regeneration of transgenic Arabidopsis thaliana (Co- 
lumbia) plants 

To generate transgenic Arabidopsis plants, Agrobacterium tumefaciens (strain C58C1 
pGV2260) is transformed with various ptxA or SbHRGP3 promoter/GUS vector con- 
structs. The agrobacterial strains are subsequently used to generate transgenic plants. 
To this end, a single transformed Agrobacterium colony is incubated overnight at 28°C 
in a 4 mL culture (medium: YEB medium with 50 ng/mL kanamycin and 25 ng/mL ri- 
fampicin). This culture is subsequently used to inoculate a 400 mL culture in the same 
medium, and this is incubated overnight (28°C, 220 rpm) and spun down (GSA rotor, 
8,000 rpm, 20 min). The pellet is resuspended in infiltration medium (1/2 MS medium; 
0.5 g/L MES, pH 5.8; 50 g/L sucrose). The suspension is introduced into a plant box 
(Duchefa), and 100 mL of SILWETL-77 (heptamethyltrisiloxan modified with polyal- 
kylene oxide; Osi Specialties Inc., Cat. P030196) was added to a final concentration of 
0.02%. In a desiccator, the plant box with 8 to 12 plants is exposed to a vacuum for 10 
to 15 minutes, followed by spontaneous aeration. This is repeated twice or 3 times. 
Thereupon, all plants are planted into flowerpots with moist soil and grown under long- 
day conditions (daytime temperature 22 to 24°C, nighttime temperature 19°C; relative 
atmospheric humidity 65%). The seeds are harvested after 6 weeks. 

As an alternative, transgenic Arabidopsis plants can be obtained by root transforma- 
tion. White root shoots of plants with a maximum age of 8 weeks are used. To this end, 
plants which are kept under sterile conditions in 1 MS medium (1% sucrose; 100mg/L 
inositol; 1.0 mg/L thiamine; 0.5 mg/L pyridoxine; 0.5 mg/L nicotinic acid; 0.5 g MES, pH 
5.7; 0.8 % agar) are used. Roots are grown on callus-inducing medium for 3 days (1x 
Gamborg's B5 medium; 2% glucose; 0.5 g/L mercaptoethanol; 0.8% agar; 0.5 mg/L 
2,4-D (2,4-dichlorophenoxyacetic acid); 0.05 mg/L kinetin). Root sections 0.5 cm in 
length are transferred into 10 to 20 mL of liquid callus-inducing medium (composition 
as described above, but without agar supplementation), inoculated with 1 mL of the 
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above-described overnight agrobacterial culture (grown at 28°C, 200 rpm in LB) and 
shaken for 2 minutes. After excess medium has been allowed to run off, the root ex- 
plants are transferred to callus-inducing medium with agar, subsequently to callus- 
inducing liquid medium without agar (with 500 mg/L betabactyl, SmithKline Beecham 
5 Pharma GmbH, Munich), incubated with shaking and finally transferred to shoot- 
inducing medium (5 mg/L 2-isopenteny!adenine phosphate; 0.15 mg/L indole-3-acetic 
acid; 50 mg/L kanamycin; 500 mg/L betabactyl). After 5 weeks, and after 1 or 2 me- 
dium changes, the small green shoots are transferred to germination medium (1 MS 
medium; 1% sucrose; 100 mg/L inositol; 1.0 mg/L thiamine; 0.5 mg/L pyridoxine; 
10 0.5 mg/L nicotinic acid; 0.5 g MES, pH 5.7; 0.8% agar) and regenerated into plants. 

5,2: Transformation and regeneration of crop piants 

The Agrobacterium-medlaXedi plant transformation using standard transformation and 
regeneration techniques may also be carried out for the purposes of transforming crop 
15 plants (Gelvin 1995; Glick 1993). 

For example, oilseed rape can be transformed by cotyledon or hypocotyl transforma- 
tion (Moloney 1989; De Block 1989). The use of antibiotics for the selection of Agro- 
bacteria and plants depends on the binary vector and the Agrobacterium strain used for 
20 the transformation. The selection of oilseed rape is generally carried out using kana- 
mycin as selectable plant marker. 

The /Igrobactera/m-mediated gene transfer in linseed (Linum usitatissimum) can be 
carried out using for example a technique described by Mlynarova (1994). 

25 

The transformation of soya can be carried out using, for example, a technique de- 
scribed in EP-A1 0424 047 or in EP-A1 0397 687, US 5,376,543, US 5,169,770. 

The transformation of maize or other monocotyledonous plants can be carried out us- 
30 ing, for example, a technique described in US 5,591 ,616. 

The transformation of plants using particle bombardment, polyethylene glycol-mediated 
DNA uptake or via the silicon carbonate fiber technique is described, for example, by 
Freeling & Walbot (1993) "The maize handbook" ISBN 3-540-97826-7, Springer Verlag 
35 New York). 

Example 6: Detection of the tissue-specific expression 

To identify the characteristics of the promoter and the essential elements of the latter 
which bring about its tissue specificity, it is necessary to place the promoter itself and 

40 various fragments thereof before what is known as a reporter gene, which allows the 
determination of the expression activity. An example which may be mentioned is the 
bacterial p-glucuronidase (Jefferson 1987a). The p-glucuronidase activity can be de- 
tected in-planta by means of a chromogenic substrate such as 5-bromo-4-chloro-3- 
indolyl-p-D-glucuronic acid in an activity staining (Jefferson 1987b). To study the tissue 

45 specificity, the plant tissue is cut, embedded, stained and analyzed as described (for 
example Baumlein 1991b). 
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A second assay permits the quantitative determination of the GUS activity in the tissue 
studied. For the quantitative activity determination, MUG (4-methylumbelliferyl-p-D- 
glucuronide) is used as substrate for p-glucuronidase, and the MUG is cleaved into MU 
(methylumbelliferone) and glucuronic acid. 

5 

To do this, a protein extract of the desired tissue is first prepared and the substrate of 
GUS is then added to the extract. The substrate can be measured fluorimetrically only 
after the GUS has been reacted. Samples which are subsequently measured in a 
fluorimeter are taken at various points in time. This assay may be carried out for exam- 

10 pie with linseed embryos at various developmental stages (21, 24 or 30 days after 
flowering). To this end, in each case one embryo is ground into a powder in a 2 |j.L re- 
action vessel in liquid nitrogen with the aid of a vibration grinding mill (Type: Retsch 
MM 2000). After addition of 100|j.L of EGL buffer, the mixture is centrifuged for 10 
minutes at 25°C and 14,000 x g. The supernatant is removed and recentrifuged. Again, 

15 the supernatant is transferred to a new reaction vessel and kept on ice until further use. 
25 ml of this protein extract are treated with 65 jiL of EGL buffer (without DTT) and 
employed in the GUS assay. 10 jiL of the substrate MUG (10 mM 4-methylumbelliferyl- 
P-D-glucuronide) are now added, the mixture is vortexed, and 30 jiL are removed im- 
mediately as zero value and treated with 470 pi of Stop buffer (0.2 M Na 2 C0 3 ). This 

20 procedure is repeated for all of the samples at an interval of 30 seconds. The samples 
taken were stored in the refrigerator until measured. Further readings were taken after 
1 h and after 2 h. A calibration series which contained concentrations from 0.1 mM to 
10 mM MU (4-methylumbelliferone) was established for the fluorimetric measurement. 
If the sample values were outside these concentrations, less protein extract was em- 

25 ployed (10 \iL t 1 1 jlxL from a 1:10 dilution), and shorter intervals were measured (0 
h, 30 min, 1 h). The measurement was carried out at an excitation of 365 nm and an 
emission of 445 nm in a Fluoroscan II apparatus (Labsystem). As an alternative, the 
substrate cleavage can be monitored fluorimetrically under alkaline conditions (excita- 
tion at 365 nm, measurement of the emission at 455 nm; Spectro Fluorimeter BMG 

30 Polarstar+) as described in Bustos (1989). All the samples were subjected to a protein 
concentration determination by the method of Bradford (1976), thus allowing an identi- 
fication of the promoter activity and promoter strength in various tissues and plants. 

EGL buffer: 0.1 M KP0 4 , pH 7.8; 1 mM EDTA; 5% glycerol; 1 M DTT. 

35 

EXAMPLE 7: Analysis of ptxA and SbHRGP3 promoter expression in Arabidop- 
sis and canola 

In Arabidopsis, ptxA promoter shows strong constitutive and ubiquitous expression in 
most tissues and organs at different developmental stages and very low levels or no 
40 (by GUS staining) in seeds (Figure 3A-3G). Strong ubiquitous expression can be de- 
tected in young seedlings. The GUS expression levels are low in the organs at the re- 
productive stages (siliques and flowers). No GUS histochemical stain is detected in 
seeds (Table 1). In canola, the expression patterns are very similar to those in Arabi- 
dopsis (Figure 4A-4G, Table 1). 

45 
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Table 1 . GUS expression controlled by ptxA promoter in Arabidopsis and canola 



Plant spe- 
cies 


Seedlings 


Leaves at 
early repro- 
ductive sta- 
ges 


Roots at early 

reproductive 

stages 


Flowers 


Siliques or 
seedpods 


Seeds 


Arabidopsis 


+++++ 


++ 


++ 


++ 


++ 




Canola 


+++++ 


++++ 


N/A 


++* 


-/+ 





*no expression in petals, medium levels of expression in sepals; a range of GUS expression 
levels measured by histochemical assay (- to +++++) 



Expression profiles of ptxA homologue in soybean showed no or very low expression in 
abiotic stressed roots, leaves, shoots, and rosettes under normal conditions, high ex- 
pression in stems and roots (normal and infected) and flowers, and strong expression 
in cafIL 

These expression patterns found in Arabidopsis and canola are entirely different from 
those which would be expected from the expression patterns controlled by MsPRP2 
promoter, since nucleotide sequence of ptxA promoter is highly homologous to 
MsPRP2 promoter. MsPRP2 promoter is reported as a salt-inducible and highly root- 
specific promoter (Bastola 1998; WO 99/53016). These results indicate that sequence 
similarity and expression patterns are not correlated. 

In Arabidopsis, SbHRGP3 promoter shows almost identical expression patterns but 
lower expression levels in general compared to ptxA promoter (Figure 5A-5D). 

EXAMPLE 8: Assessment of expression patterns by real time RT PGR analysis 

Total RNA is extracted from plant tissues using Qiagen RNeasy Plant Mini Kit (Cat. No 
74904). Quality and quantity of the RNA are determined using Molecular Probes Ri- 
boGreen Kit (Cat. No. R-1 1490) on the Spectra MAX Gemini. One jig of RNA is used 
for RT-PCR (Roche RT-PCR AMV kit, Cat. No. 1483188) in the reaction solution I un- 
der the optimized PGR program described below. 

Reaction solution I: 

1 jig RNA 
2|iL10x Buffer 

4 \xL 25 mM MgCI 2 

2 jiL 1 mM dNTPs 

2 jliL 3.2 p.g Random Primers 
1 iiL 50 units RNase Inhibitor 
0.8 |iL 20 units AMV-RT polymerase 
Fill to 20 jiL with sterile water 

PCR Program 

1) 25°C 10minutes 

2) 42°C 1hour 

3) 99°C5minutes 

4) 4°C Stop reaction 
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RT-PCR sample is used for the LightCycler reaction (Roche: LightCycler FastStart 
DNA Master SYBR Green I, Cat. No. 3003230). ~ 

LightCycler reaction 
11.6fiL sterile water 
5 2.4uL 25mM MgCI 2 

2uL SYBER Green Polymerase mix 
2uL 10mM Specific Primer Mix 
2uL RT-PCR reaction product 

LightCycler Program 

10 1) 95°C5minutes 

2) 95°C 30seconds 

3) 61°C 40seconds 

4) 72°C 40seconds - Repeat steps 2-4 for 30 cycles 

5) 72°C 10minutes 
15 6) 4°C Stop reaction 

Standardizing the concentration of RNA (1 ug) in each of the RT-PCR reactions is suf- 
ficient to directly compare samples if the same primers are used for each Lightcycler 
reaction. The output results are a number that corresponds to the cycle of PCR at 

20 which the sample reaches the inflection point in the log curve generated. The lower the 
cycle number, the higher the concentration of target RNA present in the sample. Each 
sample is repeated in triplicate and an average is generated to produce the sample 
crosspoint" value. The lower the crosspoint, the stronger the target gene is expressed 
in that sample. (For detailed procedure see Roche Molecular Biochemicals LightCycler 

25 System: Reference Guide May 1999. version) 

EXAMPLE 9: Utilization of transgenic crops 

PtxA or SbHRGP3 promoter may be employed to either express transgenes in a target 
plant or to suppress expression of endogenous genes (e.g., by antisense or double- 

30 stranded RNA; see above), thereby improving - for example - biomass and/or yield or 
tolerant to biotic and abiotic environmental stresses. The chimeric constructs are 
transformed into dicotyledonous and monocotyledonous plants. Standard methods for 
transformation in the art can be used if required. Transformed plants are regenerated 
using known methods. Various phenotypes are measured to determine improvement of 

35 biomass, yield, fatty acid composition, high oil, disease tolerance, or any other pheno- 
types that link yield enhancement or stability. Gene expression levels are determined 
at different stages of development and at different generations (T„ to T 2 plants or fur- 
ther generations). Results of the evaluation in plants lead to determine appropriate 
genes in combination with this promoter to increase yield. 

40 

EXAMPLE 10: Expression of selectable marker gene in dicotyledonous plants 

A chimeric construct composed of ptxA or SbHRGP3 promoter and selectable marker 
gene can be transformed into dicotyledonous plants such as Arabidopsis soybean or 
canola, but is not restricted to these plant species. Standard methods for transforma- 
45 tion in the art can be used if required. Transformed plants are selected under the se- 
lection agent of interest and regenerated using known methods. Selection scheme is 
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examined at early developmental stages of tissues or tissue culture cells. Gene ex- 
pression levels can be determined at different stages of development and at different 
generations (T 0 to T 2 plants or further generations). Results of the evaluation in plants 
lead to determine appropriate genes in combination with this promoter. 

5 

EXAMPLE 11: Expression of selectable marker gene in monocotyledonous 
plants 

A chimeric construct composed of ptxA or SbHRGP3 promoter and selectable marker 
gene can be transformed into monocotyledonous plants such as rice, barley, maize, 

1 0 wheat, or ryegrass but is not restricted to these plant species. Any methods for improv- 
ing expression in monocotyledonous plants are applicable such as addition of intron or 
exon with intron in 5'UTR either non-spliced or spliced. Standard methods for trans- 
formation in the art can be used if required. Transformed plants are selected under the 
selection agent of interest and regenerated using known methods. Selection scheme is 

15 examined at early developmental stages of tissues or tissue culture cells. Gene ex- 
pression levels can be determined at different stages of development and at different 
generations (T 0 to T 2 plants or further generations). Results of the evaluation in plants 
lead to determine appropriate genes in combination with this promoter. 

20 Example 12: Deletion analysis 

The cloning method is described by Rouster (1997) and Sambrook (1989). Detailed 
mapping of the ptxA or SbHRGP3 promoter (i.e., narrowing down of the nucleic acid 
segments relevant for its specificity) is performed by generating various reporter gene 
expression vectors which firstly contain the entire promoter region and secondly vari- 

25 ous fragments thereof. Firstly, the entire promoter region or fragments thereof are 
cloned into a binary vector containing GUS or other reporter gene. To this end, frag- 
ments are employed firstly, which are obtained by using restriction enzymes for the 
internal restriction cleavage sites in the full-length promoter sequence. Secondly, PCR 
fragments are employed which are provided with cleavage sites introduced by primers. 

30 The chimeric GUS constructs containing various deleted promoters are transformed 
into Arabidopsis and other plant species using transformation methods in the current 
art. Promoter activity is analyzed by using GUS histochemical assays or other appro- 
priate methods in various tissues and organs at the different developmental stages. 

35 Example 13: in vivo mutagenesis 

The skilled worker is familiar with a variety of methods for the modification of the pro- 
moter activity or identification of important promoter elements. One of these methods is 
based on random mutation followed by testing with reporter genes as described above. 
The in vivo mutagenesis of microorganisms can be achieved by passage of the plas- 

40 mid (or of another vector) DNA through E. coli or other microorganisms (for example 
Bacillus spp. or yeasts such as Saccharomyces cerevisiae) in which the ability of main- 
taining the integrity of the genetic information is disrupted. Conventional mutator strains 
have mutations in the genes for the DNA repair system (for example mutHLS, mutD 
mutTand the like; for reference, see Rupp 1996). The skilled worker is familiar with 

45 these strains. The use of these strains is illustrated for example by Greener (1994) 
The transfer of mutated DNA molecules into plants is preferably effected after selection 
and testing of the microoganisms. Transgenic plants are generated and analyzed as 
described above. 
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Example 14: PLACE Analysis for ptxA Promoter (SEQ ID NO: 1) 

Based on the below given PLACE results are potential TATA box is localized at base 
pair 549 to base pair 554 of SEQ ID NO: 1. In consequence the 5* untranslated region 
starts at about base pair 584 and extends to base pair 863 of SEQ ID NO: 1. The se- 
5 quence described by SEQ ID NO: 1 end just before the ATG start codon. Based on the 
promoter element analysis there seem to be no clusters of promoter elements in the 
first 300 base pairs of the sequence described by SEQ ID NO: 1. It is therefore very 
likely that the core region of the ptxA promoter extents from about base pair 300 to 
about base pair 583 of the sequence described by SEQ ID NO: 1. 

10 

The following clusters of promoter elements were identified in the ptxA promoter as 
described by SEQ ID NO: 1: 

15 Motif Name Location (Strand) Motif Sequence 



AMYBOX2 
C8GCARGAT 
CAATBOX1 
20 CARGCW8GAT 
CCAATBOX1 
DOFCORE2M 



25 



30 



35 



40 



50 



EBOXBNNAPA 

GATABOX 

GT1CONSENSUS 

GTGANTG10 

GTGANTG10 

IBOX 

IBOXCORE 

IBOXCORENT 

MYBST1 

MYCATERD1 

MYCATRD22 

MYCCONSENSUSAT 

MYCCONSENSUSAT 

POLASIG1 

POLASIG2 

POLASIG3 

POLLEN1LELAT52 



537 <+) 
571 <+/-) 

368(+) ; 439, 525 (-) 
571 <+/-) 
367 (+) 

334, 357, 382, 389, 400, 429 
446, 517, 591 (-) 
407, 409 (+); 407, 409 (-) 
537 (-) 

363, 



TATCCAT 

CWWWWWWWWG 

CAAT 

CWWWWWWWWG 
CCAAT 



(+) ; AAAG 



337 (+), 
424, 544 
406, 452 
479 (-) 



(~> 



518, 593 <-) 



535 
536 
534 
537 
409 
407 
407 
409 
550 
396 
462 
359 



PYRIMIDINEBOXOSRAMY1A 



SEBFCONSSTPR10A 
SEF4MOTIFGM7S 
TAAAGSTKST1 
45 TATABOX5 

TATCCAOSAMY 



476 

301 

388, 

549 

537 



TATCCAYMOTI FOSRAMY3D 



(-) 
(-> 
(-) 
(-) 

(+); 
(+); 
( + ) 
( + ) ; 
(+) 
(+) 
( + ) 
( + ); 
590 
( + ) 
( + ) 
399 
(-) 
( + ) 
537 



407 
409 



(-) 
(-) 



407, 409 (-) 



595 
( + ) 



( + ) 



(-) 



CANNTG 

GATA 

GRWAAW 

GTGA 

GTGA 

GATAAG 

GATAA 

GATAAGR 

GGATA 

CATGTG 

CACATG 

CANNTG 

CANNTG 

AATAAA 

AATTAAA 

AATAAT 

AGAAA 

CCTTTT 

YTGTCWC 

RTTTTTR 

TAAAG 

TTATTT 

TATCCA 

TATCCAY 



Example 15: PLACE Analysis for SbHRGP3 Promoter (SEQ ID NO: 2) 

Based on the below given PLACE results are potential TATA box is localized at base 
pair 1147 to base pair 1152 of SEQ ID NO: 2. In consequence the 5* untranslated re- 
gion starts at about base pair 1179 and extends to base pair 1380 of SEQ ID NO: 2. 
The sequence described by SEQ ID NO: 2 ends 12 base pairs before the ATG start 
codon. Based on the promoter element analysis there seem to be no clusters of pro- 
moter elements in the first 800 base pairs of the sequence described by SEQ ID NO: 2. 
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It is therefore very likely that the core region of the SbHRGP3 promoter extents from 
about base pair 800 to about base pair 1179 of the sequence described by SEQ ID 
NO: 2. The following clusters of promoter elements were identified in the SbHRGP3 
promoter as described by SEQ ID NO: 2: 



Motif Name 



Location (Strand) 



Motif Sequence 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



-30 0 ELEMENT 

AMYBOX1 

ARFAT 

BOXIINTPATPB 

C8GCARGAT 

CAATBOX1 

CARGCW8GAT 
CCAATBOX1 
DOFCOREZM 
DOFCOREZM 
GAREIOSREPI 
GATABOX 
GATABOX 
GT1CONSENSUS 
GT1CONSENSUS 
GTGANTG10 
IBOXCORE 
INRNTPSADB 
MARTBOX 
MYB1LEPR 
MYBCORE 
MYB PLANT 
MYBPZM 
MYBST1 
PALBOXPPC 
POLASIG1 
POLASIG2 
POLASIG3 



856 (+) TGHAAARK 
841 (-) TAACARA 
1166 (+) TGTCTC 
966 (+) ATAGAA 
1014 {+/-) CWWWWWWWWG 
801, 1014, 1228, 1234 (+) ; CAAT 
996, 1212, 1258, 1274 (-) 
1014 (+/-) 
1212 (-) 

852, 859, 931, 1026, 1080, 1339, 1349 {+) 



825, 951, 1189 (-) 
841 (-) 

868, 915, 1283, 1311, 1324 (+) 
1172, 1231 (~) 

1083, 1283, 1311, 1324, 1332 (+) 
1104, 1131, 1149, 1238 (-) 
855, 989 (+) ; 936 (-) 
1283, 1311, 1324 (+) 
852, 976 (-) 
1124 (+) 



1119 (+) 
842 {+) 
1301 (+) 
1303 (+) 
1323 (+) 
1190 (+) 
1049, 1128 (-) 
1054 (-) 

1015 (+); 1146 (-) 
POLLEN1LELAT52 1082 (+) ; 1133 (-) 
PYRIMIDINEBOXOSRAMY1A 93 0 {-) 
QELEMENTZMZM13 933 (+) 
RAV1AAT 1100, 1355 { + ) 

RBCSCONSENSUS 1177 (+) 
REALPHALGLHCB2 1 1197 (+) 

ROOTMOTIFTAPOX1 540, 811, 1046, 1236 ( + ) ; 802, 1229, 
RYREPEATBNNAPA 94 0 ( + ) 
RYREPEATGMGY2 94 0 ( + ) 
RYREPEATLEGUMINBOX 94 0 ( + ) 

SEBFCONSSTPR10A 1165 (+) ; 989 (-) 



SEF1MOTIF 
SV4 0COREENHAN 
TAAAGSTKST1 
TATABOX4 
TATAB0X5 
TATAPVTRNALEU 
TATCCAOSAMY 
TGTCACACMCUCUMI SIN 
TRANS INITDI COTS 
TRANS INITMONOCOTS 
WBOXATNPR1 
WUSATAg 



1046 (+) 
1189 (-) 

1079, 1348 (+) ; 951 (-) 
1042 (-) 

1050, 1124, 1129, 1147 (+) ; 1085 (-) 
1041 (+) 
1322 (-) 

988 (-) 
889 (-) 
889 (-) 
1021 (+) ; 1098 
845 (+) 



(-) 



CWWWWWWWWG 
CCAAT 
AAAG 
AAAG 
TAACAGA 
GATA 
GATA 
GRWAAW 
GRWAAW 
GTGA 
GATAA 
YTCANTYY 
TTWTWTTWTT 
GTTAGTT 
CNGTTR 
MACCWAMC 
CCWACC 
GGATA 
YTYYMMCMAMCMMC 
AATAAA 
AATTAAA 
AATAAT 
AGAAA 
CCTTTT 
AGGTCA 
CAACA 
AATCCAA 
AACCAA 
12135 ( - ) ATATT 

CATGCA 

CATGCAT 

CATGCAY 

YTGTCWC 

ATATTTAWW 

GTGGWWHG 

TAAAG 

TATATAA 

TTATTT 

TTTATATA 

TATCCA 

TGTCACA 

AMNAUGGC 

RMNAUGGC 

TTGAC 

TTAATGG 
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EXAMPLE 16: Analysis of ptxA in T 3 Arabidopsis 

Based on GUS histochemical assays, T 3 Arabidopsis lines containing a ptxA::GUS 
chimeric construct show strong expression in vegetative tissues and organs, sporadic 
5 and low expression in flowers, low to medium expression in siliques and funiculus (the 
stalk of a seed), and no expression in seeds (4, 8, and 14 Days After Flowering; DAF). 
These expression patterns are very similar to those in T 2 generation. T 2 lines show low 
to no expression or low expression in restricted regions of the flowers. In T 3l young 
flowers (4 DAF) show more expression than older flowers (8 and 14 DAF). In addition, 
10 the high copy lines (e.g. D54) show more expression in flowers than single copy lines. 
In T 2 and T 3 , however, no GUS stain is detected in seeds at various developmental 
stages (4, 8, and 14 DAF). 

For the tissues in the vegetative stages, GUS expression is measured at the mRNA 
15 levels using real time RT-PCR (Table 2). The real time RT T PCR results indicate that 
ptxA promoter controls medium to strong expression in most tissues in the vegetative 
stages. The high copy lines (e.g. D54) show stronger expression than low copy lines 
(Table 2), which is not easily distinguished by the GUS histochemical assays, since the 
expression levels in the vegetative tissues are already high. This data supports the 
20 GUS histochemical assays with respect to the effect of gene dosage found in flower. 
Quantification of GUS expression in only seeds is not feasible, since siliques and the 
region connected between seed and silique have medium level of expression, which 
can easily contaminate the expression in seed samples. 

25 Table 2: GUS expression controlled by ptxA promoter in vegetative tissues at various develop- 
mental stages of T 3 Arabidopsis 



Developmental 
stages 


Crosspoints 


D31 (1)* 


D36 (1) 


D52 (1) 


D69 (1) 


D54 (5) 


Germination 
[4 DAG] 


25.45±0.049 


23.42±0.30 


21.62+0.303 


20.88+0.116 


20.30±0.112 


Leaves & stems 
[14 DAG*] 


25.633±0.071 


23.79±0.123 


21.23+0.102 


21.4+0.095 


21.33±0.107 


Roots [14 DAG] 


24.84±0.150 


25.54+0.031 


24.41+0.369 


22.6210.124 


N/A 


Leaves & stems 
[21 DAG] 


24.84±0.128 


26.49*0.039 


24.2+0.110 


23.87±0.965 


21.77±0.327 


Roots [21 DAG] 


25.99±0.199 


24.00±0.195 


22.06±0.251 


24.97±0.502 


21.9±0.955 


Rosette leaves 


24.743*0.068 


22.770±0.07 
5 


20.030±0.05 
3 


20.85±0.095 


21.97±0.651 


Stem leaves 


24.16±.0.105 


23.045±0.186 


21.17±0.443 


21.40±0.199 


19.92±0.251 



Quantitative PCR (qPCR) experiments detected increased expression levels of the 
30 GUS gene from reverse-transcribed mRNA isolated from the tissues of the transgenic 
Arabidopsis (T 3 ). Expression levels are represented as the crosspoint observed during 
qPCR of each sample. The crosspoint represents the cycle at which PCR enters log 
linear amplification, which is directly proportional to the amount of starting template. 
Therefore, the lower the crosspoint is the higher the expression. Samples were qPCR- 
35 amplified in triplicate. The GUS expression is normalized by the internal control 
(mean+standard deviations). * Five independent events (copy number) 
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EXAMPLE 17: Construction of ptxA promoter in combination with maize Ubiq- 
uitin intron for monocot transformation 

The PtxA-GUS construct in pUC is digested with Pad and Xmal pBPSMM348 is di- 
5 gested with Fad and Xma\ to isolate maize Ubiquitin intron (ZmUbi intron) followed by 
electrophoresis and the QIAEX II Gel Extraction Kit (cat# 20021). The ZmUbi intron is 
ligated into the PtxA-GUS in pUC to generate pUC based PtxA-ZmUbi intron-GUS 
construct followed by restriction enzyme digestion with Afel and Pmel. PtxA-ZmUbi 
intron GUS cassette is cut out of a Seaplaque low melting temperature agarose gel 
10 (SeaPlaque® GTG® Agarose catalog No. 50110) after electrophoresis. A monocoty- 
ledonous base vector containing a selectable marker cassette (Monocot base vector) is 
digested with Pmel. The GUS expression cassette containing ptxA promoter-ZmUbi 
intron is ligated into the Monocot base vector) to generated PtxA-ZmUbi intron-GUS 
construct (Fig. 9). 

15 

PtxA-ZmUbi intron-GUS construct is transformed into a recombinant LBA4404 strain 
containing pSB1 (super vir plasmid) using electroporation following a general protocol 
in the art. >4grofcacter/i/m-mediated transformation in maize is performed using imma- 
ture embryo following a protocol described in US 5,591,616. An imidazolinone- 

20 herbicide selection is applied to obtain transgenic maize lines. GUS histochemical as- 
says are conducted with the following samples: immature embryos at 3 days after co- 
cultivation, in vitro roots and leaves, and young transgenic plantlets (Table 3). This 
chimeric GUS construct shows strong expression in vitro tissues and young TO plant- 
lets. This result indicates that dicotyledonous promoter in combination with monocoty- 

25 ledonous intron can be functional in monocotyledonous plants. 



Table 3: GUS expression controlled by ptxA promoter::ZmUbi intron in maize 



Plant 
species 


Immature embryo 
[3 days after co-cultivation] 


Embyogenic 
calti 


In vitro 
roots 


In vitro 
leaves 


T 0 plant- 
lets 


Maize 

* 




++++ 


++++ 


++ 


+++ 



*no expression in petals, medium levels of expression in sepals; a range of GUS expression 
levels measured by histochemical assay (- to +++++) 
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We claim: 

1 . A transgenic expression constructs for predominant expression of a nucleic acid 
sequence of interest in substantially ail vegetative plant tissues comprising a pro- 

5 moter sequence selected from the group consisting of 

a) the promoter of the Pisum sativum ptxA gene, functional equivalent fragments 
and functional equivalent homologs thereof, or their complements, having es- 
sentially the same promoter activity as the promoter of the Pisum sativum ptxA 

10 gene, and 

b) the promoter of the Glycine max extensin (SbHRGP3) gene, functional equiva- 
lent fragments and functional equivalent homologs thereof, or their comple- 
ments, having essentially the same promoter activity as the promoter of the 

15 Glycine max extensin (SbHRGP3) gene, 

wherein said promoter sequence is operably linked to a nucleic acid sequence of 
interest to be transgenically expressed, and wherein said promoter sequence is 
heterologous with respect to said nucleic acid sequence of interest. 

20 

2. The transgenic expression construct of Claim 1 , wherein the promoter sequence is 
selected from the group of sequences consisting of: 

a) the promoter of the Pisum sativum ptxA gene as described by SEQ ID NO: 1 , or 
25 its complement, 

b) a functional equivalent fragment of at least 50 consecutive base pairs of the 
promoter sequence described by SEQ ID NO: 1, or its complement, having es- 
sentially the same promoter activity as the promoter sequence described by 

30 SEQ ID NO: 1, 

c) a functional equivalent homolog of the promoter sequence described by SEQ ID 
NO: 1 which has essentially the same promoter activity as the promoter se- 
quence described by SEQ ID NO: 1, and has 

35 

i) a homology of at least 95% over a sequence of at least 100 consecutive 
base pairs to the sequence as described by SEQ ID NO: 1 and/or 

ii) hybridizes under high stringency conditions with a fragment of at least 50 
40 consecutive base pairs of the a nucleic acid molecule described by SEQ ID 

NO:1. 

3. The transgenic expression construct of Claim 2, wherein the functional equivalent 
fragment comprises a sequence from about base pair 300 to about base pair 583 

45 of the sequence described by SEQ ID NO: 1 . 
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4. The transgenic expression construct of Claim 1, wherein the promoter sequence is 
selected from the group of sequences consisting of: 

a) the promoter of the Glycine max extensin (SbHRGP3) gene as described by 
SEQ ID NO: 2, or its complement, 

5 b) a functional equivalent fragment of at least 50 consecutive base pairs of the 

promoter sequence described by SEQ ID NO: 2, or its complement, having es- 
sentially the same promoter activity as the promoter sequence described by 
SEQ ID NO: 2, 

c) a functional equivalent homolog of the promoter sequence described by SEQ ID 
10 NO: 2 which has essentially the same promoter activity as the promoter se- 

quence described by SEQ ID NO: 2, and has 

i) a homology of at least 60% over a sequence of at least 100 consecutive 
base pairs to the sequence as described by SEQ ID NO: 2 and/or 

ii) hybridizes under high stringency conditions with a fragment of at least 50 
15 consecutive base pairs of the a nucleic acid molecule described by SEQ ID 

NO: 2. 

5. The transgenic expression construct of Claim 4, wherein the functional equivalent 
fragment comprises a sequence from about base pair 800 to about base pair 1179 

20 of the sequence described by SEQ ID NO: 2. 



6. The transgenic expression construct of Claim 4, wherein the functional equivalent 
homolog is described by a sequence selected from group of sequences described 
by SEQ ID NO: 7, 8, and 9. 

25 

7. The transgenic expression construct of any of Claim 1 to 6, wherein the expression 
rate realized by the trangenic expression construct measured by an quantitative p- 
glucoronidase assay and normalized to units of (3-glucoronidase per gram of 
biomass in seed and flower tissue is less the 10% of the corresponding value in 

30 total vegetative plant tissue. 

8. The transgenic expression construct of Claim 1 to 7, wherein 

a) the nucleic acid sequence of interest to be expressed is linked operably to 
further genetic control sequences, or 

35 b) the expression construct comprises additional functional elements, or 

c) both a) and b) apply. 



9. The transgenic expression construct of Claim 1 to 8, wherein the nucleic acid 
sequence to be expressed transgenically results in, 

a) expression of a protein encoded by said nucleic acid sequence, and/or 

b) expression of sense, antisense, or double-stranded RNA encoded by said 
nucleic acid sequence. 
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10. The transgenic expression construct of Claim 1 to 9, wherein expression occurs in 
leafs, stems and roots but is not detectable in seeds. 

11. A transgenic expression vector comprising a transgenic expression construct of 
5 any of Claim 1 to 10. 

12. A non-human transgenic organism transformed with an expression construct as 
claimed in any of claims 1 to 10 or a vector as claimed in Claim 1 1 . 

10 13. The non-human transgenic organism of Claim 12, said organism selected from the 
group consisting of bacteria, yeasts, fungi, animal and plant organisms. 

14. The transgenic organism of Claim 13 selected from the group consisting of sugar- 
cane, maize, sorghum, pineapple, rice, barley, oat, wheat, rye, yam, onion, ba- 

15 nana, coconut, date, hop, rapeseed, tobacco, tomato, tagetes (marigold), soybean, 

pea, common bean, and papaya. 

15. A cell culture, part or transgenic propagation material derived from a transgenic 
organism of Claim 12 to 14. 

20 

16. A method for transgenic predominant expression of a nucleic acid sequence of 
interest in substantially all vegetative plant tissues comprising: 

i. introduction of a transgenic expression construct into a plant cell or a plant, said 
transgenic expression construct comprising a promoter sequence selected from 
25 the group consisting of 

a) the promoter of the Pisum sativum ptxA gene, functional equivalent frag- 
ments and functional equivalent homologs thereof, or their complements, 
having essentially the same promoter activity as the promoter of the Pisum 
sativum ptxA gene, and 

30 b) the promoter of the Glycine max extensin (SbHRGP3) gene, functional 

equivalent fragments and functional equivalent homologs thereof, or their 
complements, having essentially the same promoter activity as the promoter 
of the Glycine max extensin (SbHRGP3) gene, 
wherein said promoter sequence is operably linked to a nucleic acid sequence 
35 of interest to be transgenicaily expressed, and wherein said promoter sequence 

is heterologous with respect to said nucleic acid sequence of interest, 
under conditions such that said nucleic acid sequence of interest is expressed 
in said plant cell and/or predominantly expressed in the vegetative plant tissue 
and/or organs of said transgenic plant. 

40 

17. The method of Claim 16, wherein expression occurs in leafs, stems and roots but 
is not detectable in seeds. 
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18. The method of Claim 16 or 17, said method further comprising one or more of the 
following steps 

ii) identifying or selecting the transgenic plant cell comprising said transgenic ex- 
pression construct, 

iii) regenerating transgenic plant tissue from the transgenic plant cell, 

iv) regenerating a transgenic plant from the transgenic plant cell. 

19. The method of any of Claim 16 to 18, wherein the transgenic expression construct 
is characterized as in Claim 1 to 10. 

20. The use of a transgenic organism as claimed in claim 12 to 14 or of cell cultures, 
parts of transgenic propagation material derived therefrom as claimed in claim 15 
for the production of foodstuffs, animal feeds, seeds, pharmaceuticals or fine 
chemicals. 

21. A method for production of a foodstuff, animal feed, seed, pharmaceutical or fine 
chemical employing a transgenic organism as claimed in claim 12 to 14 or of cell 
cultures, parts of transgenic propagation material derived therefrom as claimed in 
claim 15. 
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(156) YP-PPPAQQTCSIDALKLGACVDVLGGLIHIGIGGSAKQTCCPLLQGLVD 

(219) IP- SPPAQPTCPIDALKLGACVDVLGGLIHIGIGGSAKQTCCPLLGGLVD 

(109) DT L KLGAC VDLLGGLVHI G I GS S AKDTCCP VLQGL VD 

(202) GGGGGGKQPTCPINALKLGACVDVLGGLIHIGLGNPVENVCCPVLQGLLE 
(301) P PPAQPTCSIDALKLGACVDVLGGLIHIGIGGSAKQTCCPLLQGLVD 

351 397 

(3 08) LDAAVCLCTTIRLKLLNINLVIPLALQVLID-CGKTPPEGFKCPSS- 

(337) LDAAI CLCTTIRLKLLNINLVIPLALQVLID- CGKTPPEGFKCPAY- 

(205 ) LDAAI CLCTTIRIiKLLNINLVIPLALQVLID- CGKTPPEGFKCPAS- 

(268) LDAAI CLCTTIRLKLLNINIILPIALQVLIDDCGKYPPKDFKCPST- 

(146) LDAAVCIiCTAIKVKLLNVNIIIPIALQVLVG-CGKTPPSGFQCPA- - 

(252) LEAAVCLCTTIRLKLLNLNIFIPLALQALIT-CGINPPSGFVCPPLT 

(351) LDAAI CLCTTI RLKLLNINI VI PLALQ VL I D CGKTPPEGFKCPAS 
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551 600 
B (551) AAAAACTAAAAAATAATTTCTCTCCTGATTTATATGAAATGACATTTTTT 

A (1) CGCAATTTTTT 

Consensus (551) ■ q ATTTTTT 

601 650 
B (601) TGGAACATGAAGG - GTATTGATTTTTACCACCTTTTACACCT - - TTCAAA 

A ( 12 ) GTGAAGCTGAGGGAGGATTGGATTTTACACCTATTCAAAAGTCATTCAAA 

Consensus (601) GAA TGA GG G ATTG TTTTAC C TT A A T TTCAAA 

651 700 

B (648) G CCATTCAAGGATGAATATAGATTTTTGGGCGATCAAACAC 

A (62) GTTTGTCCCTCCATTCAAGGATGAATGTAGATTTTTCAAGCATCAAACAC 

Consensus (651) G CCATTCAAGGATGAAT TAGATTTTT ATCAAACAC 

701 750 
B (689) AAGAATCATTACGATAACATGCTTTGGAACACACACATGCTTAAATTAAT 

A (112) AAGAATCACTAGCATAACATGCTTTGAAACCCACACA- - CTTAAATTAAT 

Consensus (701) AAGAATCA TA ATAACATGCTTTG AAC CACACA CTTAAATTAAT 

751 800 

B (739) GGTTGGAGTATCAAAT TTTAAAAT- ATTGTTGTCAAT- ACATACCC 

'A (160) GTTAGGAATATCAAATCCAATATAAAATCATAGTTGTCAATTACATACTC 

Consensus (751) G T GGA TATCAAAT T TAAAAT AT GTTGTCAAT ACATAC C 

801 850 
B (783) CGTCAATCTTCTTTTTTTTACCCAATAAACATTGAAATGTTGCTTCTTTC 

A (210) AATCAAGTCCCTTTCTTTTACCCAATAAACATCAACATATTGCTTCTTCC 

Consensus (801) TCAA CTTT TTTTACCCAATAAACAT A AT TTGCTTCTT C 

851 900 

B ( 833 ) GTTAAG CATAAAAAC ATC AAAGTCTA GCAAAATGTTGTTTTTGC 

A (260) ATTAAGCATATAAACATCAAAGTCTAAAACTAGCAAAATGTTGTTTTTAG 

Consensus (851) TTAAGCATA AAACATCAAAGTCTA GCAAAATGTTGTTTTT 

901 950 
B (877) GATGACACATTTCATA - - TAGTTTAAAGGATGCATGATTCGATTACAAAA 

A (310) GATGACACATTTCATACATAGTTTAAAAGATACTTGATTCGATTACAAAA 

Consensus (901) GATGACACATTTCATA TAGTTTAAA GAT C TGATTCGATTACAAAA 

951 1000 
B (925) ACAAAATACTAATAATTCTAGCACAAAGTTTAAAGCAAGATTATAAAGCT 

A (360) AGAAATTACCAATAGTT - TAGCACAAAGT CTAAAGCATAATTA- - AAGCA 

Consensus (951) A AAA TAC AATA TT TAGCACAAAGT TAAAGCA ATTA AAGC 
1001 1050 
B (975) TCATAGCATGTGGATATTCATTTAGAAATATAG ATTA - GATTG CCCCTTT 

A (407) TCA CATGTGCAGATTTAT GAAAAAAAGATTAAGATTGCCCCTTT 

Consensus (1001) TCA CATGTG A ATT AT GAAA A AGATTA GATTGCCCCTTT 
1051 1100 

B (1024) CATCACGGGTC TAACAGCACCACTTGTCACTACATGTCAAAAA- - TG 

A (451) CATCACGGGTCGAATAATAGCACTACTTGTCACTACATGTTAAAAAAATG 

Consensus . (1051) CATCACGGGTC TAA AGCAC ACTTGTCACTACATGT AAAAA TG 

1101 1150 
B (1069) TCCTCTAGTACAGCACCGCTTTTTACTTGATTCCCCTTGTCCATGCATGA 

A (501) TCCTCTAGTACATCAAACTTTTTCCATTGATTCCCCTTATCC ATGA 

Consensus (1101) TCCTCTAGTACA CA TTTT TTGATTCCCCTT TCC ATGA 

1151 1200 
B (1119) AAAAAATCAAAACAATATTTGGACACACAAACTTGCCCCCACTTTCCTTT 

A (547) AAAAAATAAACAAATTCTTAAGACACAAAAAAATGGCCCCACAT - CCTTT 

Consensus (1151) AAAAAAT AA A A T TT GACACA AAA TG CCCCAC T CCTTT 
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1201 1250 
B (1169) TTCTTTCTGCCCTAGTTTGTTTGAGACTCATATTGATCAAATTTGGCTAT 

A (596) TTTCTGGCCTAGTTTGTTTGA 

Consensus (1201) TTTCTG CCTAGTTTGTTTGA 

1251 1300 

B (1219) GAATTCAAACAAAAAATTCACTCTACCCATTGCATGTGT GGGGCCCA 

A (617) A TTCATTCTAACTCTTGAATATGTAACGAGGCCCA 

Consensus (1251) A TTCA TCTA C TTG AT TGT G GGCCCA 

1301 1350 
B (1266) CATATAAATCCATGAAGGATTTCAATGTCCATCCAAGTCAATGATTCAAC 

A ' (652) C - TAAAAATC AAT CAATGATTTAAC 

Consensus (1301) C TA AAATC AT CAATGATT AAC 

1351 1400 
B (1316) ATATATAACATTGAATAATTTAATTCCAATTTGCAGTATTATGATTTAGA 

A (676) ATAAAAAA TGAATAGTTTAATTCCAATTTGC 

Consensus (13 51) ATA A AA TGAATA TTTAATTCCAATTTGC 

1401 1450 
B (13 66) TTGATTGCTGCAATACGGTCCGTGAATGTGATCACTCACGAGAAAGAGGT 

A (707) TGCAACATGGTCCGTGAATATGA CTCACGAGAAAGATAT 

Consensus (14 01) TGCAA A GGTCCGTGAAT TGA CTCACGAGAAAGA T 

1451 1500 
B (1416) ATCAAAATTTCAAGGTATTTTATTTATTTTTAACAAATAAAATTTCAAGG 

A (746) ATCAAAATATCAA AATTTCATAG 

Consensus (1451) ATCAAAAT TCAA AATTTCA G 

1501 1550 
B (1466) TCTTGTTCACCATATAAACCTCCTCACTCACACCCAATTCTCTTAAGTGT 

A (769) TTTTTTTCACCATATAAACCTCATCACTCATTC - - TATTTTTTTAAGTGC 

Consensus (1501) T TT TTCACCATATAAACCTC TCACTCA C ATT T TTAAGTG 

1551 1600 
B ( 1516 ) ATGACTTCATAGTAC - -ACTACACTACTTTCTTTGAAACATGGCTAACTA 

A (817) AAAG CTTCATAGTAGTGAG C ACACAC ATTACAC TAAAATCTT CGAAAC TT 

Consensus (1551) A CTTCATAGTA A ACAC TT C T AAA T AACT 

1601 1650 
B (1564) TGCTCTAGCCAATGTTTTCATCCTTCTCTTGAACTTGAGTACCTTACTCA 

A (867) A 

Consensus (1601) 
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SEQ ID NO: 9 (1) CTTTTCAACAATCATGCCCATGTCAAGTGTAAAACAGGTTTACCTCT 

SEQ ID NO: 8 (1) AAGCTTTTCAACAATCATGCCCATGTCAAGTGTAAAACAGGTTTACCTCT 
SEQ ID NO: 7 (1) AAGCTTTTCAACAATCATGCCCATGTCAAGTGTAAAACAGGTTTACCTCT 
Consensus ( 1 ) AAGCTTTTCAACAATCATGCCCATGTCAAGTGTAAAACAGGTTTACCTCT 

51 100 
SEQ ID NO: 9 (48) CTTAAATAACCGTATTAAAATGCTGAATGATGTATATATGTGGGTTCAAA 
SEQ ID NO: 8 (51) CTTAAATAACCGTATTAAAATGCTGAATGATGTATATATGTGGGTTCAAA 
SEQ ID NO: 7 (51) CTTAAATAACCGTATTAAAATGCTGAATGATGTATATATGTGGGTTCAAA 
Consensus (51) CTTAAATAACCGTATTAAAATGCTGAATGATGTATATATGTGGGTTCAAA 

101 150 
SEQ ID NO: 9 (98) TTACATAATTTGTAAGTATGTTACACATTGTATAAATATGTTTTAGAGAA 
SEQ ID NO: 8(101) TTACATAATTTGTAAGTATGTTACACATTGTATAAATATGTTTTAGAGAA 
SEQ ID NO: 7(101) TTACATAATTTGTAAGTATGTTACACATTGTATAAATATGTTTTAGAGAA 
Cons ensus (101) TTACATAATTTGTAAGTATGTTACACATTGTATAAATATGTTTTAGAGAA 

151 200 
SEQ ID NO: 9(148) AAATGTAAACTTATATGTCTAAAGTTATAAAAGAAACATGTCCAACACAT 
SEQ ID NO: 8(151) AAATGTAAACTTATATGTCTAAAGTTATAAAAGAAACATGTCCAACACAT 
SEQ ID NO: 7(151) AAATGTAAACTTATATGTCTAAAGTTATAAAAGAAACATGTCCAACACAT 
Cons ensus (151) AAATGTAAACTTATATGTCTAAAGTTATAAAAGAAACATGTCCAACACAT 

201 250 
SEQ ID NO: 9(198) TTCAGTTAAGATTTAAATAGTATAAATTAAAAATTATCGATGATGACAAA 
SEQ ID NO: 8(201) TTCAGTTAAGATTTAAATAGTATAAATTAAAAATTATCGATGATGACAAA 
SEQ ID NO: 7(201) TTCAGTTAAGATTTAAATAGTATAA - TTAAAAATTATCGATGATGACAAA 
Cons ensus (201) TTCAGTTAAGATTTAAATAGTATAAATTAAAAATTATCGATGATGACAAA 

251 300 
SEQ ID NO: 9 (248) AAATTGTAAATATAATTCATTTTAAAAAAAGTTAAGAAATTGAAAAAGGA 
SEQ ID NO: 8(251) AAATTGTAAATATAATTCATTTTAAAAAAAGTTAAGAAATTGAAAAAGGA 
SEQ ID NO: 7(250) AAATTGTAAATATAATTCATTTTAAAAAAAGTTAAGAAATTGAAAAAGGA 
Consensus (251) AAATTGTAAATATAATTCATTTTAAAAAAAGTTAAGAAATTGAAAAAGGA 

301 350 
SEQ ID NO: 9(298) AATATCGAGAAAAAAATATGTCGATTATATATATGTGTGAGCTGAGTGAA 
SEQ ID NO: 8(301) AATATCGAGAAAAAAATATGTCGATTATATATATGTGTGAGCTGAGTGAA 
SEQ ID NO: 7(300) AATATCGAGAAAAAAATATGTCGATTATATATATGTGTGAGCTGAGTGAA 
Consensus (301) AATATCGAGAAAAAAATATGTCGATTATATATATGTGTGAGCTGAGTGAA 

351 400 
SEQ ID NO: 9(348) TATATATGTATATTTTATTTTTGACTGAATATATGTGTGTATAGACAATA 
SEQ ID NO: 8(351) TATATATGTATATTTTATTTTTGACTGAATATATGTGTGTATAGACAATA 
SEQ ID NO: 7(350) TATATATGTATATTTTATTTTTGACTGAATATATGTGTGTATAGACAATA 
Consensus (351) TATATATGTATATTTTATTTTTGACTGAATATATGTGTGTATAGACAATA 

401 450 
SEQ ID NO: 9 (398) ATGCGCAGAATGCCGATCGATGAATTGTTTACTGCATTTCCAAATATGTG 
SEQ ID NO: 8(401) ATGCGCAGAATGCCGATCGATGAATTGTTTACTGCATTTCCAAATATGTG 
SEQ ID NO: 7(400) ATGCGCAGAATGCCGATCGATGAATTGTTTACTGCATTTCCAAATATGTG 
Consensus (401) ATGCGCAGAATGCCGATCGATGAATTGTTTACTGCATTTCCAAATATGTG 

451 500 
SEQ ID NO: 9(448) TGCATAAGCGTTCCACATGTCACCCATGTTGTAATTAGTTTCTTCCCTGG 
SEQ ID NO: 8(451) TGCATAAGCGTTCCACATGTCACCCATGTTGTAATTAGTTTCTTCCCTGG 
SEQ ID NO: 7(450) TGCATAAGCGTTCCACATGTCACCCATGTTGTAATTAGTTTCTTCCCTGG 
Consensus (451) TGCATAAGCGTTCCACATGTCACCCATGTTGTAATTAGTTTCTTCCCTGG 

501 550 
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SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
Consensus 

SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
Consensus 

SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
Consensus 

SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
Consensus 

SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
Consensus 

SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
Consensus 

SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
Consensus 

SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
Consensus 

SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 

Consensus 

SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
Consensus 



9 (498 

8 (501 

7 (500 
(501 

9 (548 

8 (551 

7 (550 
(551 

9(598 

8 (601 

7 (600 
(601 

9 (648 

8 (651 

7 (650 
(651 

9 (692 

8 (695 

7 (700 
(701 

9 (742 

8 (745 

7 (750 
(751 

9 (792 

8 (795 

7 (800 
(801 

9 (842 

8 (845 

7 (849 
(851 

9 (892 

8 (895 

7 (899 
(901 

9 (942 

8 (945 
7(948 

(951 



ATGAATTACTAAGAAACAGATTGATTGATAGTACTATATTAAATTATGTA 
ATGAATTACTAAGAAACAGATTGATTGATAGTACTATATTAAATTATGTA 
ATGAATTACTAAGAAACAGATTGATTGATAGTACTATATTAAATTATGTA 
ATGAATTACTAAGAAACAGATTGATTGATAGTACTATATTAAATTATGTA 
551 600 
GCTTTACATGTCAGGAAAATGTAGTTGCAGTATTATGTAATGTAATTAAT 
GCTTTACATGTCAGGAAAATGTAGTTGCAGTATTATGTAATGTAATTAAT 
GCTTTACATGTCAGGAAAATGTAGTTGCAGTATTATGTAATGTAATTAAT 
GCTTTACATGTCAGGAAAATGTAGTTGCAGTATTATGTAATGTAATTAAT 
601 650 
AGGAAGTCACAGACAATTTGAAGACAATTTCTTTAGCTTACCTATCTCAT 
AGGAAGTCACAGACAATTTGAAGACAATTTCTTTAGCTTACCTATCTCAT 
AGGAAGTCACAGACAATTTGAAGACAATTTCXTTAGCTTACCTATCTCAT 
AGGAAGTCACAGACAATTTGAAGACAATTTCTTTAGCTTACCTATCTCAT 
651 700 

GCCACAATTATGTACTTACGACAGTAAAATGTTTAAAAGCAAAA 

GCCACAATTATGTACTTACGACAGTAAAATGTTTAAAAGCAAAA 

GCCACAATTATGTACTTACGACAGTAAAATGTTTAAAAGCAAAAGCAAAA 

GCCACAATTATGTACTTACGACAGTAAAATGTTTAAAAGCAAAA 

701 750 

AAAAGAAAGAAGAAGAAGAAGTAATAAATGGAATTATATAGAATGTACTC 

AAAAGAAAGAAGAAGAAGAAGTAATAAATGGAATTATATAGAATGTACTC 

AAAAGAAAGAAGAAGAAGAAGTAATAAATGGAATTATATAGAATGTACTC 

AAAAGAAAGAAGAAGAAGAAGTAATAAATGGAATTATATAGAATGTACTC 

751 800 

TTTGTCTTCATCTGCCCTATAATTCCTGCAGCAGCCAAAGCATAATAGCA 

TTTGTCTTCATCTGCCCTATAATTCCTGCAGCAGCCAAAGCATAATAGCA 

TTTGTCTTCATCTGCCCTATAATTCCTGCAGCAGCCAAAGCATAATAGCA 

TTTGTCTTCATCTGCCCTATAATTCCTGCAGCAGCCAAAGCATAATAGCA 

801 850 

TGCAATATGCACATATTCGTTTTAGGCTTTTAGCCTCCACGATCTGTTAA 

TGCAATATGCACATATTCGTTTTAGGCTTTTAGCCTCCACGATCTGTTAA 

TGCAATATGCACATATTCGTTTTAGGCTTTTAGC - TCCACGATCTGTTAA 

TGCAATATGCACATATTCGTTTTAGGCTTTTAGCCTCCACGATCTGTTAA 

851 900 

TGGAAAGTGAAAAGTAAGAGATATGAAGTTCATTATGGCAGCCATGGTCC 

TGGAAAGTGAAAAGTAAGAGATATGAAGTTCATTATGGCAGCCATGGTCC 

TGGAAAGTGAAAAGTAAGAGATATGAAGTTCATTATGGCAGCCATGGTCC 

TGGAAAGTGAAAAGTAAGAGATATGAAGTTCATTATGGCAGCCATGGTCC 

901 950 

CAGGGAAGCACTAGAAGATATGAAATGACATAAAAGGTCACCATGCATAA 

CAGGGAAGCACTAGAAGATATGAAATGACATAAAAGGTCACCATGCATAA 

CAGGGAAGCACTAGAAGATATGAAATGAC - TAAAAGGTCACCATGCATAA 

CAGGGAAGCACTAGAAGATATGAAATGACATAAAAGGTCACCATGCATAA 

951 1000 

TGCTTTAAATGCTTGCTATAGAATCAAAAAATGAAGAGATGTGACAAATT 

TGCTTTAAATGCTTGCTATAGAATCAAAAAATGAAGAGATGTGACAAATT 

TGCTTTAAATGCTTGCTATAGAATCAAAAAATGAAGAGATGTGACAAATT 

TGCTTTAAATGC TTG CTATAGAATCAAAAAATGAAGAGATGTGACAAATT 
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SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
Consensus 

SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
Consensus 

SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
Consensus 

SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
Consensus 

SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
Consensus 

SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
Consensus 

SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
Consensus 

SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
Consensus 



9 (992) 
8 (995) 
7 (998) 
(1001) 

9(1042 
8 (1045 
7 (1048 
(1051 

9 (1092 
8 (1095 
7 (1098 
(1101 

9 (1142 

8 (1145 

7 (1148 
(1151 

9 (1192 

8 (1195 
7 (1198 

(1201 

9 (1242 
8 (1245 
7 (1239 
(1251 

9 (1292 
8 (1295 
7 (1289 
(1301 

9 (1342 
8 (1345 
7 (1338 

(1351 



1001 1050 
GTTACATCTAATACGCAATAATTTGACAAAGACGACTATGCGTTTATATA 
GTTACATCTAATACGCAATAATTTGACAAAGACGACTATGCGTTTATATA 
GTTACATCTAATACGCAATAATTTGACAAAGACGACTATGCGTTTATATA 
GTTACATCTAATACGCAATAATTTGACAAAGACGACTATGCGTTTATATA 
1051 1100 
TTTATTTTAATTAGTTGGCGTCTCTTATTATAAAGAAAATAAGGGCAGTG 
TTTATTTTAATTAGTTGGCGTCTCTTATTATAAAGAAAATAAGGGCAGTG 
TTTATTTTAATTAGTTGGCGTCTCTTATTATAAAGAAAATAAGGGCAGTG 
TTTATTTTAATTAGTTGGCGTCTCTTATTATAAAGAAAATAAGGGCAGTG 
1101 1150 
TCAACATTTCCAGGCAACTAGTTAGTTATTTTATTTTCTTGTTTATAATT 
TCAACATTTCCAGGCAACTAGTTAGTTATTTTATTTTCTTGTTTATAATT 
TCAACATTTCCAGGCAACTAGTTAGTTATTTTATTTTCTTGTTTATAATT 
TCAACATTTCCAGGCAACTAGTTAGTTATTTTATTTTCTTGTTTATAATT 
1151 1200 
ATTTCCATATAGCTAGCTGTCTCTATCTAATCCAAATCCGCTTTCCACAA 
ATTTCCATATAGCTAGCTGTCTCTATCTAATCCAAATCCGCTTTCCACAA 
ATTTCCATATAGCTAGCTGTCTCTATCTAATCCAAATCCGCGTTCCACAA 
ATTTCCATATAGCTAGCTGTCTCTATCTAATCCAAATCCGCTTTCCACAA 
1201 1250 
CCAACTTGGTCGCATTGGTCCAAAAAACTCAATATCAATATTTTCGAAAT 
CCAACTTGGTCGCATTGGTCCAAAAAACTCAATATCAATATTTTCGAAAT 

CCAACTTGGT CCAAAAAACTCAATATCAATATTTTCAAAAT 

CCAACTTGGTCGCATTGGTCCAAAAAACTCAATATCAATATTTTCGAAAT 
1251 1300 
AGTTTTAGCATTGTTTAGGAAGAGAATTGTAAGAGATAAAATCTAAGTAC 
AGTTTTAGCATTGTTTAGGAAGAGAATTGTAAGAGATAAAATCTAAGTAC 
AGTTTTAGCATTGTTTAGGAAGAGAATTGTAAGAGATAAAATCTAAGTAC 
AGTTTTAGCATTGTTTAGGAAGAGAATTGTAAGAGATAAAATCTAAGTAC 
1301 1350 
TCCACCTACCAAGATAAAATAGTTGGATAAATGGGTAAAAAAAGTTGTAT 
TCCACCTACCAAGATAAAATAGTTGGATAAATGGGTAAAAAAAGTTGTAT 
TCCA'CCTACCAAGATAAAATAGTTGGATAAATGGGTAAAAAA - GTTGTAT 
TCCACCTACCAAGATAAAATAGTTGGATAAATGGGTAAAAAAAGTTGTAT 
1351 1393 

AAAGGGCAACACTACCTCTCCTAATGGCAGTA 

AAAGGGC^ACACTACCTCTCCTAATGGCAGTACCAAAACCCAAG 
AAAGGGCAACACTACCTCTCCTAATGGCAGTACCAAAACCCAAG 
AAAGGGCAACACTACCTCTCCTAATGGCAGTACCAAAACCCAAG 
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SEQUENCE LISTING 



<110> BASF Plant Science GmbH 

<120> Transgenic expression constructs for vegetative plant tissue spe- 
cific expression of nucleic acids 

<130> PF55368-2 / AE20040055 

<160> 11 

<170> Patentln version 3.1 

<210> 1 

<211> 863 

<212> DNA 

<213> Pisum sativum 

<220> 

<221> promoter 

<222> (1)..(863) 

<223> promoter region of ptxA gene including 5 ' -untranslated region 
<220> 

<221> TATA_signal 

<222> (549) . . (554) 
<223> 

<220> 

<221> S'UTR 

<222> (584) . . (863) 

<223> 

<220> 

<221> misc_feature 

<222> (300) . . (583) 

<223> potential core region of the promoter comprising clusters of prom 
oter elements 



<400> 



1 



gcaatttttt gtgaagctga 
gtttgtccct ccattcaagg 
agcataacat gctttgaaac 
taaaatcata gttgtcaatt 
caacatattg cttcttccat 
gtttttagga tgacacattt 
aaattaccaa tagtttagca 



gggaggattg gattttacac ctattcaaaa gtcattcaaa 
atgaatgtag atttttcaag catcaaacac aagaatcact 
ccacacactt aaattaatgt taggaatatc aaatccaata 
acatactcaa tcaagtccct ttcttttacc caataaacat 
taagcatata aacatcaaag tctaaaacta gcaaaatgtt 
catacatagt ttaaaagata cttgattcga ttacaaaaag 
caaagtctaa agcataatta aagcatcaca tgtgcagatt 



420 



300 



360 



240 



180 



120 



60 
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25 



30 



35 



tatgaaaaaa 


agattaagat 


tgcccctttc 


atcacgggtc 


gaataatagc 


actacttgtc 


480 


actacatgtt 


aaaaaaatgt 


cctctagtac 


atcaaacttt 


ttccattgat 


tccccttatc 


540 


catgaaaaaa 


ataaacaaat 


tcttaagaca 


caaaaaaatg 


gccccacatc 


cfctttttctg 


600 


gcctagtttg 


tttgaattca 


ttctaactct 


tgaatatgta 


acgaggccca 


ctaaaaatca 


660 


atcaatgatt 


taacataaaa 


aatgaatagt 


ttaattccaa 


tttgctgcaa 


catggtccgt 


720 


gaatatgact 


cacgagaaag 


atatatcaaa 


atatcaaaat 


ttcatagttt 


ttttcaccat 


780 


ataaacctca 


tcactcattc 


tattttttta 


agtgcaaagc 


ttcatagtag 


tgagcacaca 


840 


cattacacta 


aaatcttcga 


aac 








863 



10 <210> 2 

<211> 1380 

<212> DNA 

<213> Glycine max 

15 <220> 

<221> promoter 

<222> (1) . . (1380) 

<223> promoter region of SbHRGP3 gene including 5' untranslated region 

20 <220> 

<2 21> misc_f eature 

<222> (800) . . (1179) 

<223> potential core region of the promoter comprising clusters of prom 
oter elements 



<220> 

<221> . 5'UTR 

<222> (1180) . . (1380) 

<223> potential 5 1 UTR 

<220> 

< 2 2 1 > TATA_s igna 1 

<222> (1147) . . (1152) 
<223> 



<400> 2 

tagaaagctt ttcaacaatc atgcccatgt caagtgtaaa acaggtttac ctctcttaaa 60 

taaccgtatt aaaatgctga atgatgtata tatgtgggtt caaattacat aatttgtaag 120 

tatgttacac attgtataaa tatgttttag agaaaaatgt aaacttatat gtctaaagtt 180 

40 ataaaagaaa catgtccaac acatttcagt taagatttaa atagtataaa ttaaaaatta 240 

tcgatgatga caaaaaattg taaatataat tcattttaaa aaaagttaag aaattgaaaa 300 

aggaaatatc gagaaaaaaa tatgtcgatt atatatatgt gtgagctgag tgaatatata 36 0 

tgtatatttt atttttgact gaatatatgt gtgtatagac aataatgcgc agaatgccga 420 

tcgatgaatt gtttactgca tttccaaata tgtgtgcata agcgttccac atgtcaccca 4 80 

45 tgttgtaatt agtttcttcc ctggatgaat tactaagaaa cagattgatt gatagtacta 540 

tattaaatta tgtagcttta catgtcagga aaatgtagtt gcagtattat gtaatgtaat 600 

taataggaag tcacagacaa tttgaagaca atttctttag cttacctatc tcatgccaca 660 
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attatgtact tacgacagta aaatgtttaa aagcaaaaaa aagaaagaag aagaagaagt 720 

aataaatgga attatataga atgtacfcctt tgtcttcatc tgccctataa ttcctgcagc 780 

agccaaagca taatagcatg caatatgcac atattcgttt taggctttta gcctccacga 84 0 

tctgttaatg gaaagtgaaa agtaagagat atgaagttca ttatggcagc catggtccca 900 

5 gggaagcact agaagatatg aaatgacata aaaggtcacc atgcataatg ctttaaatgc 960 

ttgctataga atcaaaaaat gaagagatgt gacaaattgt tacatctaat acgcaataat 1020 

ttgacaaaga cgactatgcg tttatatatt tattttaatt agttggcgtc tcttattata 1080 

aagaaaataa gggcagtgtc aacatttcca ggcaactagt tagttatttt attttcttgt 1140 

ttataattat ttccatatag ctagctgtct ctatctaatc caaatccgct ttccacaacc 1200 

10 aacttggtcg cattggtcca aaaaactcaa tatcaatatt ttcgaaatag ttttagcatt 1260 

gtttaggaag agaattgtaa gagataaaat ctaagtactc cacctaccaa gataaaatag 132 0 

ttggataaat gggtaaaaaa agttgtataa agggcaacac tacctctcct aatggcagta 1380 

<210> 3 
15 <211> 26 
<212> DNA 

<213> Oligonucleotide primer ptxA5 ' 



<400> 3 

20 ggcgcgcccg caattttttg tgaagc 

<210> 4 
<211> 25 
<212> DNA 

25 <213> Oligonucleotide primer ptxA3 • 
<400> 4 

tctagataag tttcgaagat tttag 

30 <210> 5 

<211> 29 

<212> DNA 

<213> Oligonucleotide primer SbHRGP3-5' 

35 <400> 5 

tctagataga agcttttcaa caatcatgc 

<210> 6 

<211> 24 

40 <212> DNA 

<213> Oligonucleotide primer SbHRGP3-3* 

<400> 6 

agatcttact gccattagga gagg 



45 



26 



25 



29 



24 



<210> 7 
<211> 1381 
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<212> DNA 
<213> Glycine max 

<220> 

5 < 2 2 1 > TATA_s igna 1 

<222> (1146) . . (1151) 
<223> 

<220> 

10 <221> misc_f eature 

<222> (801) . . (1178) 

<223> potential core region of promoter 

<220> 
15 <221> 5'UTR 

<222> (1369) . . (1381) 
<223> 

<220> 

20 <221> promoter 

<222> (1)..(1368) 
<223> 

<400> 7 

25 aagcttttca acaatcatgc ccatgtcaag tgtaaaacag gtttacctct cttaaataac 

cgtattaaaa tgctgaatga tgtatatatg tgggttcaaa ttacataatt tgtaagtatg 12 0 

ttacacattg tataaatatg ttttagagaa aaatgtaaac ttatatgtct aaagttataa 18 0 

aagaaacatg tccaacacat ttcagttaag atttaaatag tataattaaa aattatcgat 24 0 

gatgacaaaa aattgtaaat ataattcatt ttaaaaaaag ttaagaaatt gaaaaaggaa 3 00 

30 atatcgagaa aaaaatatgt cgattatata tatgtgtgag ctgagtgaat atatatgtat 360 

attttatttt tgactgaata tatgtgtgta tagacaataa tgcgcagaat gccgatcgat 420 

gaattgttta ctgcatttcc aaatatgtgt gcataagcgt tccacatgtc acccatgttg 48 0 

taattagttt cttccctgga tgaattacta agaaacagat tgattgatag tactatatta 54 0 

aattatgtag ctttacatgt caggaaaatg tagttgcagt attatgtaat gtaattaata 600 

35 ggaagtcaca gacaatttga agacaatttc tttagcttac ctatctcatg ccacaattat 660 

gtacttacga cagtaaaatg tttaaaagca aaagcaaaaa aaagaaagaa gaagaagaag 72 0 

taataaatgg aattatatag aatgtactct ttgtcttcat ctgccctata attcctgcag 78.0 

cagccaaagc ataatagcat gcaatatgca catattcgtt ttaggctttt agctccacga 84 0 

tctgttaatg gaaagtgaaa agtaagagat atgaagttca ttatggcagc catggtccca 900 

40 gggaagcact agaagatatg aaatgactaa aaggtcacca tgcataatgc tttaaatgct 960 

tgctatagaa tcaaaaaatg aagagatgtg acaaattgtt acatctaata cgcaataatt 102 0 

tgacaaagac gactatgcgt ttatatattt attttaatta gttggcgtct cttattataa 108 0 

agaaaataag ggcagtgtca acatttccag gcaactagtt agttatttta ttttcttgtt 1140 

tataattatt tccatatagc tagctgtctc tatctaatcc aaatccgcgt tccacaacca 1200 

45 acttggtcca aaaaactcaa tatcaatatt ttcaaaatag ttttagcatt gtttaggaag 1260 

agaattgtaa gagataaaat ctaagtactc cacctaccaa gataaaatag ttggataaat 1320 

gggtaaaaaa gttgtataaa gggcaacact acctctccta atggcagtac caaaacccaa 1380 



60 
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1381 



<210> 8 

<211> 1388 

5 <212> DNA 

<213> Glycine max 

<220> 

< 2 2 1 > TATARS ignal 

10 <222> (1143) . . (1148) 
<223> 



<220> 

<221> 5'UTR 
15 <222> (1176) . . (1388) 
<223> 



<220> 

<221> promoter 

20 <222> (1)..<1175) 

<223> potential promoter region 



<220> 

<221> misc_feature 

25 <222> (796) (1175) 

<223> potential core region of promoter 



<400> 8 

aagcttttca acaatcatgc ccatgtcaag 

30 cgtattaaaa tgctgaatga tgtatatatg 
ttacacattg tataaatatg ttttagagaa 
aagaaacatg tccaacacat ttcagttaag 
tgatgacaaa aaattgtaaa tataattcat 
aatatcgaga aaaaaatatg tcgattatat 

35 tattttattt ttgactgaat atatgtgtgt 
tgaattgttt actgcatttc caaatatgtg 
gtaattagtt tcttccctgg atgaattact 
aaattatgta gctttacatg tcaggaaaat 
aggaagtcac agacaatttg aagacaattt 

40 tgtacttacg acagtaaaat gtttaaaagc 
aatggaatta tatagaatgt actctttgtc 
aaagcataat agcatgcaat atgcacatat 
ttaatggaaa gtgaaaagta agagatatga 
agcactagaa gatatgaaat gacataaaag 

45 tatagaatca aaaaatgaag agatgtgaca 
caaagacgac tatgcgttta tatatttatt 
aaataagggc agtgtcaaca tttccaggca 



tgtaaaacag gtttacctct cttaaataac 60 

tgggttcaaa ttacataatt tgtaagtatg 120 

aaatgtaaac ttatatgtct aaagttataa 180 

atttaaatag tataaattaa aaattatcga 240 

tttaaaaaaa gttaagaaat tgaaaaagga 3 00 

atatgtgtga gctgagtgaa tatatatgta 360 

atagacaata atgcgcagaa tgccgatcga 420 

tgcataagcg ttccacatgt cacccatgtt 480 

aagaaacaga ttgattgata gtactatatt 540 

gtagttgcag tattatgtaa tgtaattaat 600 

ctttagctta cctatctcat gccacaatta 660 

aaaaaaaaga aagaagaaga agaagtaata 720 

ttcatctgcc ctataattcc tgcagcagcc 780 

tcgttttagg cttttagcct ccacgatctg 84 0 

agttcattat ggcagccatg gtcccaggga 900 

gtcaccatgc ataatgcttt aaatgcttgc 960 

aattgttaca tctaatacgc aataatttga 1020 

ttaattagtt ggcgtctctt attataaaga 1080 

actagttagt tattttattt tcttgtttat 1140 
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aattatttcc atatagctag ctgtctctat ctaatccaaa tccgctttcc acaaccaact 1200 

tggtcgcatt ggtccaaaaa actcaatatc aatattttcg aaatagtttt agcattgttt 1260 

aggaagagaa ttgtaagaga taaaatctaa gtactccacc taccaagata aaatagttgg 1320 

ataaatgggt aaaaaaagtt gtataaaggg caacactacc tctcctaatg gcagtaccaa 13 80 

5 aacccaag 1388 



<210> 9 

<211> 1373 

<212> DNA 

10 <213> Glycine max 

<220> 

<221> TATA_signal 

<222> (1140) . . (1145) 
15 <223> 



<220> 

<221> misc_feature 

<222> (793) . . (1172) 

20 <223> potential core region of promoter 



<220> 

<221> 5'UTR 
<222> (1173) . . (1373) 
25 <223> 



<220> 

<22l> promoter 

<222> (1) . . (1172) 

30 <223> potential promoter region 



<400> 9 

cttttcaaca atcatgccca tgtcaagtgt 
attaaaatgc tgaatgatgt atatatgtgg 

35 cacattgtat aaatatgttt tagagaaaaa 
aaacatgtcc aacacatttc agttaagatt 
tgacaaaaaa ttgtaaatat aattcatttt 
atcgagaaaa aaatatgtcg attatatata 
tttatttttg actgaatata tgtgtgtata 

40 attgtttact gcatttccaa atatgtgtgc 
attagtttct tccctggatg aattactaag 
ttatgtagct ttacatgtca ggaaaatgta 
aagtcacaga caatttgaag acaatttctt 
acttacgaca gtaaaatgtt taaaagcaaa 

45 ggaattatat agaatgtact ctttgtcttc 
gcataatagc atgcaatatg cacatattcg 
atggaaagtg aaaagtaaga gatatgaagt 



aaaacaggtt tacctctctt aaataaccgt 60 

gttcaaatta cataatttgt aagtatgtta 120 

tgtaaactta tatgtctaaa gttataaaag 180 

taaatagtat aaattaaaaa ttatcgatga 240 

aaaaaaagtt aagaaattga aaaaggaaat 3 00 

tgtgtgagct gagtgaatat atatgtatat 360 

gacaataatg cgcagaatgc cgatcgatga 420 

ataagcgttc cacatgtcac ccatgttgta 480 

aaacagattg attgatagta ctatattaaa 540 

gttgcagtat tatgtaatgt aattaatagg 600 

tagcttacct atctcatgcc acaattatgt 660 

aaaaagaaag aagaagaaga agtaataaat 720 

atctgcccta taattcctgc agcagccaaa 780 

ttttaggctt ttagcctcca cgatctgtta 840 

tcattatggc agccatggtc ccagggaagc 90 0 
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actagaagat atgaaatgac ataaaaggtc accatgcata atgctttaaa tgcttgctat 960 

agaatcaaaa aatgaagaga tgtgacaaat tgttacatct aatacgcaat aatttgacaa 1020 

agacgactat gcgtttatat atttatttta attagttggc gtctcttatt ataaagaaaa 1080 

taagggcagt gtcaacattt ccaggcaact agttagttat tttattttct tgtttataat 1140 

5 tatttccata tagctagctg tctctatcta atccaaatcc gctttccaca accaacttgg 1200 

tcgcattggt ccaaaaaact caatatcaat attttcgaaa tagttttagc attgtttagg 1260 

aagagaattg taagagataa aatctaagta ctccacctac caagataaaa tagttggata 1320 

aatgggtaaa aaaagttgta taaagggcaa cactacctct cctaatggca gta 1373 

10 <210> 10 

<211> 1924 

<212> DNA 

<213> Artificial construct of ptxA promoter and ubiquitin intron 

15 <220> 

<22l> Intron 

<222> (875) . . (1924) 

<223> Zea maize ubiquitin intron 

20 <220> 

<221> misc_feature 

<222> (829) . . (874) 

<223> multiple cloning site 

25 <220> 

<221> 5'UTR 

<222> (584) . . (828) 

<223> 

30 <220> 

< 2 2 1 > TATA__s igna 1 

<222> (549) . . (554) 
<223> 

35 <220> 

<221> promoter 

<222> (1) . . (583) 

<223> potential promoter region 

40 <220> 

<221> misc_feature 
<222> (300) . . (583) 

<223> potential core region of promoter ' 
45 <400> 10 

gcaatttttt gtgaagctga gggaggattg gattttacac ctattcaaaa gtcattcaaa 60 
gtttgtccct ccattcaagg atgaatgtag atttttcaag catcaaacac aagaatcact 120 
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agcataacat gctttgaaac ccacacactt 
taaaatcata gttgtcaatt acatactcaa 
caacatattg cttcttccat taagcatata 
gtttttagga tgacacattt catacatagt 
5 aaattaccaa tagtttagca caaagtctaa 
tatgaaaaaa agattaagat tgcccctttc 
actacatgtt aaaaaaatgt cctctagtac 
catgaaaaaa ataaacaaat tcttaagaca 
gcctagtttg tttgaattca ttctaactct 

10 atcaatgatt taacataaaa aatgaatagt 
gaatatgact cacgagaaag atatatcaaa 
ataaacctca tcactcattc tattttttta 
gccaagcttg catgcctgca ggtcgactct 
acctccgctfc caaggtacgc cgctcgtcct 

15 tcggcgttcc ggtccatggt tagggcccgg 
atccgtgttt gtgttagatc cgtgctgcta 
agacacgttc tgattgctaa cttgccagtg 
agccgttccg cagacgggat cgatttcatg 
tttgcccttt tcctttattt caatatatgc 

20 tgcttttttt tgtcttggtt gtgatgatgt 
gtagaattct gtttcaaact acctggtgga 
catacatatt catagttacg aattgaagat 
atacatgttg atgcgggttt tactgatgca 
gtgatgatgt ggtgtggttg ggcggtcgtt 

25 ttcaaactac ctggtgtatt tattaatttt 
tagttacgag tttaagatgg atggaaatat 
ggttttactg atgcatatac atgatggcat 
gagtacctat ctattataat aaacaagtat 
tggatgatgg catatgcagc agctatatgt 

30 tttatttgct tggtactgtt tcttttgtcg 
gcag 



aaattaatgt taggaatatc aaatccaata 180 

tcaagtccct ttcttttacc caataaacat 240 

aacatcaaag tctaaaacta gcaaaatgtt 300 

ttaaaagata cttgattcga ttacaaaaag 360 

agcataatta aagcatcaca tgtgcagatt 420 

atcacgggtc gaataatagc actacttgtc 480 

atcaaacttt ttccattgat tccccttatc 540 

caaaaaaatg gccccacatc cttttttctg 600 

tgaatatgta acgaggccca ctaaaaatca 660 

ttaattccaa tttgctgcaa catggtccgt 720 

atatcaaaat ttcatagttt ttttcaccat 780 

agtgcaaagc ttcatagtta attaaggcgc 840 

agaggatctc ccccaaatcc acccgtcggc 900 

cccccccccc ccctctctac cttctctaga 960 

tagttctact tctgttcatg tttgtgttag 1020 

gcgttcgtac acggatgcga cctgtacgtc 1080 

tttctctttg gggaatcctg ggatggctct 1140 

attttttttg tttcgttgca tagggtttgg 1200 

cgtgcacttg tttgtcgggt catcttttca 1260 

ggtctggttg ggcggtcgtt ctagatcgga 1320 

tttattaatt ttggatctgt atgtgtgtgc 1380 

gatggatgga aatatcgatc taggataggt 144 0 

tatacagaga tgctttttgt tcgcttggtt 1500 

cattcgttct agatcggagt agaatactgt 1560 

ggaactgtat gtgtgtgtca tacatcttca 1620 

cgatctagga taggtataca tgttgatgtg 1680 

atgcagcatc tattcatatg ctctaacctt 174 0 

gttttataat tattttgatc ttgatatact 180 0 

ggattttttt agccctgcct tcatacgcta 1860 

atgctcaccc tgttgtttgg tgttacttct 1920 

1924 



<210> 11 

<211> 23 

35 <212> DNA 

<213> oligonucleotide primer ptxA3 ! -2 



40 



<400> 11 

tctagataaa ctatgaagct ttg 23 
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